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WO 03/031573 PCTYUS02/31460 
TOLL-LIKE RECEPTOR 3 SIGNALING AGONISTS AND ANTAGONISTS 

Field of the Invention 

The invention pertains to signal transduction by Toll-like receptor 3 (TLR3), 
5 which is believed to be involved in innate immunity. More specifically, the invention 
pertains to screening methods useful for the identification and characterization of TLR3 
ligands, TLR3 signaling agonists, and TLR3 signaling antagonists. 



Background of the Invention 

10 Toll-like receptors (TLRs) are a family of at least ten highly conserved receptor 

proteins (TLR1 - TLR10) which recognize pathogen-associated molecular patterns 
(PAMPs) and act as key elements in innate immunity. As members of the pro- 
inflammatory interleukin-1 receptor (IL-1R) family, TLRs share homologies in their 
cytoplasmic domains called Toll/IL-IR homology (TIR) domains. PCT published 

15 applications PCT/US98/08979 and PCT/US01/16766. Intracellular signaling 

mechanisms mediated by TIRs appear generally similar, with MyD88 (Wesche H et al. 

(1997) Immunity 7:837-47; Medzhitov R et al. (1998) Mol Cell 2:253-8; Adachi O et al. 

(1998) Immunity 9:143-50; Kawai T et al. (1999) Immunity 1 1:1 15-22) and tumor 
necrosis factor receptor-associated factor 6 (TRAF6; C30 Z et al. (1996) Nature 

20 383:443-6; Lomaga MA et al. (1999) Genes Dev 13:1015-24) believed to have critical 
roles. Signal transduction between MyD88 and TRAF6 is known to involve members 
of the serine-threonine kinase IL-1 receptor-associated kinase (IRAK) family, including 
at least IRAK-1 and IRAK-2. Muzio M et al. (1997) Science 278:1612-5, 

Ligands for many but not all of the TLRs have been described. For instance, it 

25 has been reported that TLR2 signals in response to peptidoglycan and lipopeptides. 
Yoshimura A et al. (1999) J Immunol 163:1-5; Brightbill HD et al. (1999) Science 
285:732-6; Aliprantis AO et al. (1999) Science 285:736-9; Takeuchi O et al. (1999) 
Immunity 11:443-51; Underhill DM et al. (1999) Nature 401:811-5. TLR4 has been 
reported to signal in response to lipopolysaccharide (LPS). Hoshino K et al. (1999) J 

30 Immunol 162:3749-52; Poltorak A et al. (1998) Science 282:2085-8; Medzhitov R et al. 
(1997) Nature 388:394-7. Bacterial flagellin has been reported to be a natural ligand 
for TLRS. Hayashi F et al. (2001) Nature 410:1099-1 103. TLR6, in conjunction with 
with TLR2, has been reported to signal in response to proteoglycan. Ozinsky A et al. 
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(2000) PNAS USA 97:13766-71 ; Takeuchi O et al. (2001) Int Immunol 13:933-40. 
Recently it was recently reported that TLR9 is a receptor for CpG DNA. Hemmi H et 
al. (2000) Nature 408:740-5. 

5 Summary of the Invention 

The invention provides screening methods and compositions useful for the 
identification and characterization of compounds which themselves signal through 
Toll-like receptor 3 (TLR3) or which influence signaling through TLR3. Compounds 
which themselves signal through TLR3 are presumptively immunostimulatory. 
10 Compounds which influence signaling through TLR3 include both agonists and 
antagonists of TLR3 signaling activity. The methods provided by the invention are 
adaptable to high throughput screening, thus accelerating the identification and 
characterization of previously unknown inducers, agonists, and antagonists of TLR3 
signaling activity. 

15 The methods of the invention rely at least in part on the ability to assess TLR3 

signaling activity. It has surprisingly been discovered according to the present 
invention that reporter constructs having reporter genes under control of certain 
promoter response elements sensitive to TLR3 signaling activity are useful in the 
screening assays of the invention. For example it has been surprisingly discovered 

20 according to the present invention that a reporter gene under control of interferon- 
specific response element (ISRE) is sensitive to TLR3 signaling activity. 

It has also surprisingly been discovered according to the present invention that 
screening assays for TLR ligands and other assays involving TLR signaling activity can 
benefit from optimization for at least one of the variables of (a) concentration of test 

25 and/or reference compound, (b) kinetics of the assay, and (c) selection of reporter. 
Interpretation of assay data can be influenced by each of these variables. 

In one aspect the invention provides a screening method for identifying an 
immunostimulatory compound. The method according to this aspect of the invention 
involves the steps of (a) contacting a functional TLR3 with a test compound under 

30 conditions which, in absence of the test compound, permit a negative control response 
mediated by a TLR3 signal transduction pathway; (b) detecting a test response 
mediated by the TLR3 signal transduction pathway; and (c) determining the test 
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compound is an immunostimulatory compound when the test response exceeds the 
negative control response. In this and in all aspects of the invention, in one 
embodiment the screening method is performed on a plurality of test compounds. A 
test compound according to this and all aspects of the invention is in one embodiment a 

5 member of a library of compounds, preferably a combinatorial library of compounds. 
Also in this and in all aspects of the invention, a test compound is preferably a small 
molecule, a nucleic acid, a polypeptide, an oligopeptide, or a lipid. In more preferred 
embodiments, the test compound is a small molecule or a nucleic acid. In one 
embodiment a test compound that is a nucleic acid is a CpG nucleic acid. 

10 In another aspect the invention provides a screening method for identifying an 

immunostimulatory compound. The method according to this aspect of the invention 
involves the steps of (a) contacting a functional TLR3 with a test compound under 
conditions which, in presence of a reference immunostimulatory compound, permit a 
reference response mediated by a TLR3 signal transduction pathway; (b) detecting a 

15 test response mediated by the TLR3 signal transduction pathway; and (c) determining 
the test compound is an immunostimulatory compound when the test response equals or 
exceeds the reference response. In this and other aspects of the invention, a reference 
immunostimulatory compound is preferably a small molecule, a nucleic acid, a 
polypeptide, an oligopeptide, or a lipid. In one embodiment the reference 

20 immunostimulatory compound is a CpG nucleic acid. 

In a further aspect the invention provides a screening method for identifying a 
compound that modulates TLR3 signaling activity. The method according to this 
aspect of the invention involves the steps of (a) contacting a functional TLR3 with a 
test compound and a reference immunostimulatory compound under conditions which, 

25 in presence of the reference immunostimulatory compound alone, permit a reference 
response mediated by a TLR3 signal transduction pathway; (b) detecting a test- 
reference response mediated by the TLR3 signal transduction pathway; (c) determining 
the test compound is an agonist of TLR3 signaling activity when the test-reference 

response exceeds the reference response; and (d) determining the test compound is an 

i 

30 antagonist of TLR3 signaling activity when the reference response exceeds the test- 
reference response. 
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In yet another aspect the invention provides a screening method for identifying 
species specificity of an immunostimulatory compound. The method according to this 
aspect of the invention involves the steps of (a) measuring a first species-specific 
response mediated by a TLR3 signal transduction pathway when a functional TLR3 of 
5 a first species is contacted with a test compound; (b) measuring a second species- 
specific response mediated by the TLR3 signal transduction pathway when a functional 
TLR3 of a second species is contacted with the test compound; and (c) comparing the 
first species-specific response with the second species-specific response. In a preferred 
embodiment the functional TLR3 of the first species is a human TLR3. In one 
10 preferred embodiment the functional TLR3 of the first species is a human TLR3 and 
the functional TLR3 of the second species is a mouse TLR3. 

In preferred embodiments of the foregoing aspects of the invention, the 
response mediated by the TLR3 signal transduction pathway is measured 
quantitatively. 

15 Also in preferred embodiments of the foregoing aspects of the invention, the 

functional TLR3 is expressed in a cell. For example, in one embodiment the cell is an 
isolated mammalian cell that naturally expresses the functional TLR3. Alternatively, in 
another embodiment the cell is an isolated mammalian cell that does not naturally 
express the functional TLR3, wherein the cell has an expression vector for TLR3. For 

20 example, in one preferred embodiment the cell is a human 293 fibroblast. In other 

. i 

embodiments, the functional TLR3 is part of a cell-freeisystem. 

Particularly useful in embodiments of the invention involving cells which 
express functional TLR3 are cells which include a reporter construct sensitive to TLR3 
signaling. In one embodiment the cell includes an expression vector having an isolated 

25 nucleic acid which encodes a reporter construct selected from the group of nuclear 
factor-kappa B-luciferase (NF-kB-1uc), DFN-specific response element-luciferase 
(ISRE-luc), interleukin-6-luciferase (IL-6-luc), interleukin 8-luciferase (IL-8-luc), 
interleukin 12 p40 subunit-luciferase (IL-12 p40-luc), interleukin 12 p40 subunit-beta 
galactosidase (IL-12 p40-p-Gal), activator protein 1-luciferase (APl-luc), interferon 

30 alpha-luciferase (DFN-a-luc), interferon beta-luciferase (IFN-P-luc), RANTES- 

luciferase (RANTES-luc), tumor necrosis factor-luciferase (TNF-luc), EP-10-luciferase 
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(IP-10-luc), and interferon-inducible T cell alpha chemoattractant-luciferase (I-TAC- 
luc). In a preferred embodiment the reporter construct is ISRE-luc. 

In one embodiment according to each of the foregoing aspects of the invention, 
the functional TLR3 is part of a complex with a non-TLR protein selected from the 

5 group consisting of MyD88, IL-1 receptor associated kinase 1-3 (IRAKI, IRAK2, 
IRAK3), tumor necrosis factor receptor-associated factor 1-6 (TRAF1 - TRAF6), IkB, 
NF-kB, MyD88-adapter-like (Mai), Toll-interleukin 1 receptor (TIR) domain- 
containing adapter protein (TIRAP), Tollip, Rac, and functional homologues and 
derivatives thereof. In a related embodiment functional TLR3 is part of a complex with 

JO a non-TLR protein listed above, excluding MyD88. 

Also according to each of the foregoing aspects of the invention, in one 
embodiment the response mediated by a TLR3 signal transduction pathway is induction 
of a reporter gene under control of a promoter response element selected from the 
group consisting of ISRE, IL-6, IL-8, IL-12 p40, IFN-a, IFN-p, IFN-co, RANTES, 

15 TNF, IP-10, and I-TAC. For example, in a preferred embodiment the reporter gene 
under control of a promoter response element is selected from the group consisting of 
ISRE-luc, EL-6-luc, IL-8-luc, IL-12 p40-luc, IL-12 p40-P-Gal, BFN-a-luc, IFN-P-luc, 
RANTES-luc, TNF-luc, IP-10-luc, and I-TAC-luc. In one preferred embodiment the 
reporter gene under control of a promoter response element is ISRE-luc. In yet another 

20 preferred embodiment the reporter gene is selected from the group consisting of DFN- 
al-luc and IFN-a4-luc. 

In yet another embodiment according to each of the foregoing aspects of the 
invention, the response mediated by a TLR3 signal transduction pathway is selected 
from the group consisting of (a) induction of a reporter gene under control of a minimal 

25 promoter responsive to a transcription factor selected from the group consisting of API, 
NF-kB, ATF2, IRF3, and IRF7; (b) secretion of a chemokine; and (c) secretion of a 
cytokine. For example, in one preferred embodiment the response mediated by a TLR3 
signal transduction pathway is induction of a reporter gene selected from the group 
consisting of APl-luc and NF-kB-1uc. In another preferred embodiment the response 

30 mediated by a TLR3 signal transduction pathway is secretion of a type 1 IFN. In yet 
another preferred embodiment the response mediated by a TLR3 signal transduction 
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pathway is secretion of a chemokine selected from the group consisting of CCL5 
(RANTES), CXCL9 (Mig), CXCL10 (IP-10), and CXCL1 1 (MAC). 

The sensitivity and interpretation of the screening methods of the present 
invention can be optimized. Such optimization involves proper selection of any one or 

5 combination of (a) concentration of test and/or reference compound, (b) kinetics of the 
assay, and (c) reporter. Thus, further according to each of the first three aspects of the 
invention, in one embodiment the contacting a functional TLR3 with a test compound 
further entails, for each test compound, contacting with the test compound at each of a 
plurality of concentrations. For example, each test compound may be evaluated at 

10 various concentrations which differ by log increments. Also according to each of the 
foregoing aspects of the invention, in one embodiment the detecting is performed 4-12 
hours, preferably 6-8 hours, following the contacting. Similarly, in yet another 
embodiment according to each of the foregoing aspects of the invention, the detecting 
is performed 16-24 hours following the contacting. Detecting performed 4-12 hours, 

75 preferably 6-8 hours, following the contacting is believed to be more sensitive to 

affinity of interaction than is detecting at later times. Detecting performed 1 6-24 hours 
or later following the contacting is believed to be more sensitive to stability and 
duration of receptor/ligand interaction. Furthermore, because certain reporter 
constructs are more sensitive to certain TLRs than others, proper matching of reporter 

20 to TLR assay is important to increase signal-to-noise ratio in the readout of a particular 
assay. 

Brief Description of the Figures 

This application includes examples which refer to figures or other drawings. It 
25 is to be understood that the referenced figures are illustrative only and are not essential 
to the enablement of the claimed invention. 

Figure 1 is two paired bar graphs showing (A) the induction of NF-kB and (B) 
the amount of IL-8 produced by 293 fibroblast cells transfected with human TLR9 in 
response to exposure to various stimuli, including CpG-ODN, GpC-ODN, LPS, and 
30 medium. 

Figure 2 is a bar graph showing the induction of NF-kB produced by 293 

i 

fibroblast cells transfected with murine TLR9 in response to exposure to various 
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stimuli, including CpG-ODN, methylated CpG-ODN (Me-CpG-ODN), GpC-ODN, 
LPS, and medium. 

Figure 3 is a series of gel images depicting the results of reverse transcriptase- 
polymerase chain reaction (RT-PCR) assays for murine TLR9 (mTLR9), human TLR9 
5 (hTLR9), and glyceraldehyde-3-phosphate dehydrogenase (GAPDH) in untransfected 
control 293 cells, 293 cells transfected with mTLR9 (293-mTLR9), and 293 cells 
transfected with hTLR9 (293-MTLR9). 

Figure 4 is a graph showing the degree of induction of NF-kB-1uc by various 
stimuli in stably transfected 293-hTLR9 cells. 1 
10 Figure 5 is a graph showing the degree of induction of NF-kB-1uc by various 

stimuli in stably transfected 293-mTLR9 cells. 

Figure 6 is a graph showing fold induction of response as a function of 
concentration for a series of four related immunostimulatory nucleic acids contacted 
with human 293 fibroblast cells stably transfected with murine TLR9 and NF-kB-1uc. 
15 Concentrations listed correspond to EC50 for each ligand. 

Figure 7 is a graph showing kinetics of EC50 determinations for a series of five 
immunostimulatory nucleic acids contacted with human 293 fibroblast cells stably 
transfected with murine TLR9 and NF-kB-1uc. 

Figure 8 is a graph showing kinetics of ECS 0 determinations for the same series 
20 of five immunostimulatory nucleic acids as in Figure 7 contacted with human 293 
fibroblast cells stably transfected with human TLR9 and NF-kB-1uc. 

Figure 9 is a graph showing kinetics of maximal activity (fold induction of 
response) for the same series of five immunostimulatory nucleic acids as in Figure 7 
contacted with human 293 fibroblast cells stably transfected with murine TLR9 and 
25 NF-kB-1uc. 

Figure 10 is a graph showing kinetics of maximal activity (fold induction of 
response) for the same series of five immunostimulatory nucleic acids as in Figure 7 
contacted with human 293 fibroblast cells stably transfected with human TLR9 and NF- 
kB-1uc. 

30 Figure 1 1 is a bar graph showing fold induction of response as measured using 

various luciferase reporter constructs (NF-kB-1uc, IP-10-luc, RANTES-luc, ISRE-luc, 
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and IL-8-luc) in combination with TLR7, TLR8, and TLR9, each TLR contacted with a 
specific reference TLR ligand. 



Detailed Description of the Invention 

5 The invention in certain aspects provides screening methods useful for the 

identification, characterization, and optimization of immunostimulatory compounds, 
including but not limited to immunostimulatory nucleic acids and immunostimulatory 
small molecules, as well as assays for the identification and optimization of agonists 
and antagonists of TLR3 signaling. The methods according to the invention include 

10 both cell-based and cell-free assays. In certain preferred embodiments the screening 
methods are performed in a high throughput manner. The methods can be used to 
screen libraries of compounds for their ability to modulate immune activation that 
involves TLR3 signaling. 

In one aspect the invention provides a screening method for identifying an 

15 immunostimulatory compound. The method according to this aspect of the invention 
involves the steps of (a) contacting a functional TLR3 with a test compound under 
conditions which, in absence of the test compound, permit a negative control response 
mediated by a TLR3 signal transduction pathway; (b) detecting a test response 
mediated by the TLR3 signal transduction pathway; and (c) determining the test 

20 compound is an immunostimulatory compound when the test response exceeds the 
negative control response. In a second aspect the invention provides a screening 
method for identifying an immunostimulatory compound. The method according to 
this aspect of the invention involves the steps of (a) contacting a functional TLR3 with 
a test compound under conditions which, in presence of a reference immunostimulatory 

25 compound, permit a reference response mediated by a TLR3 signal transduction 
pathway; (b) detecting a test response mediated by the TLR3 signal transduction 
pathway; and (c) determining the test compound is an immunostimulatory compound 
when the test response equals or exceeds the reference response. It will be appreciated 
that these two aspects of the invention differ in that one involves comparison of the test 

30 compound against a negative control and the other involves comparison of the test 
compound against a positive control. 
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For these and other aspects of the invention, the TLR3 is preferably a 
mammalian TLR3, such as human TLR3 or mouse TLR3. Nucleotide and amino acid 
sequences for human TLR3 and murine TLR3 have previously been described. The 
nucleotide sequence for human TLR3 cDNA can be found as GenBank accession no. 

5 NMJ)03265 (SEQ ID NO: 1), and the deduced amino acid sequence for human TLR3, 
encompassing 904 amino acids, can be found as GenBank accession nos NPJ)03256 
(SEQ ID NO:2). The nucleotide sequence for murine TLR3 cDNA can be found as 
GenBank accession no. AF355152 (SEQ ID NO:3), and the deduced amino acid 
sequence for murine TLR3, encompassing 905 amino acids, can be found as GenBank 

10 accession no. AAK261 17 (SEQ ID NO:4). 

As used herein, a "functional TLR3" shall refer to a polypeptide, including a 
full length naturally occurring TLR3 polypeptide as described above, which 
specifically binds a TLR3 ligand and signals via a Toll/interleukin-1 receptor (TIR) 
domain. In addition to full length naturally occurring TLR3, a functional TLR3 thus 

15 also refers to allelic variants, fusion proteins, and truncated versions of the same, 
provided the polypeptide specifically binds a TLR3 ligand and signals via a TIR 
domain. In a preferred embodiment, the functional TLR3 includes a human TLR3 
extracellular domain having an amino acid sequence provided by amino acids 38-707 
according to SEQ ID NO:2. In another preferred embodiment, the functional TLR3 

20 includes a murine TLR3 extracellular domain having an amino acid sequence provided 
by amino acids 39-708 according to SEQ ID NO:4. Preferably, the functional TLR3 
signals through a TIR domain of TLR3. 

In certain embodiments of this and other aspects of the invention, the functional 
TLR3 is expressed, either naturally or artifically, in a cell. In some embodiments, a cell 

25 expressing TLR3 for use in the methods of the invention expresses TLR3 and no other 
TLR. Alternatively, in some embodiments a cell expressing TLR3 for use in the 
methods of the invention expresses both TLR3 and at least one other TLR, e.g., TLR7, 
TLR8, or TLR9. In one embodiment the cell is an isolated mammalian cell that 
naturally expresses functional TLR3. Cells and tissues known to express TLR3 include 

30 dendritic cells (DCs), intraepithelial cells, and placenta. Muzio M et al. (2000) J 

Immunol 164:5998-6004; Cario E et al. (2000) Infect Immun 68:7010-7; Rock FL et al. 
(1998) Proc Natl Acad Sci USA 95:588-93. The term "isolated" as used herein, with 
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reference to a cell or to a compound, means substantially free of or separated from 
components with which the cell or compound is normally associated in nature, e.g., 
other cells, nucleic acids, proteins, lipids, carbohydrates or in vivo systems to an extent 
practical and appropriate for its intended use. 

5 In another embodiment the cell can be one that, as it occurs in nature, is not 

capable of expressing TLR3 but which is rendered capable of expressing TLR3 through 
the artificial introduction of an expression vector for TLR3. Examples of cell lines 
lacking TLR3 include, but are not limited to, human 293 fibroblasts (ATCC CRL-1573) 
and HEp-2 human epithelial cells (ATCC CCL-23). Examples of cell lines lacking 

10 TLR9 include, but are not limited to, human 293 fibroblasts (ATCC CRL-1573), 
MonoMac-6, THP-1, U937, CHO, and any TLR9 knock-out. Typically the cell, 
whether it is capable of expressing TLR3 naturally or artificially, preferably has all the 
necessary elements for signal transduction initiated through the the TLR3 receptor. For 
example, it is believed that TLR9 signaling requires the adapter protein MyD88 in an 

15 early step of signal transduction. In contrast, TLR3 appears not to require MyD88 but 
may require other factors further downstream, e.g., factors that induce mitogen- 
activated protein kinase (MAPK) and factors downstream of MAPK. 

When indicated, introduction of a particular TLR into a cell or cell line is 
preferably accomplished by transient or stable transfection of the cell or cell line with a 

20 TLR-encoding nucleic acid sequence operatively linked to a gene expression sequence 
(as described herein). For example, a cell artificially induced to express TLR3 for use 
in the methods of the invention includes a cell that has been transiently or stably 
transfected with a TLR3 expression vector. Any suitable method of transient or stable 
transfection can be employed for this purpose. 

25 An expression vector for TLR3 will include at least a nucleotide sequence 

coding for a functional TLR3 polypeptide, operably linked to a gene expression 
sequence which can direct the expression of the TLR3 nucleic acid within a eukaryotic 
or prokaryotic cell. A "gene expression sequence" is any regulatory nucleotide 
sequence, such as a promoter sequence or promoter-enhancer combination, which 

30 facilitates the efficient transcription and translation of the nucleic acid to which it is 
operably linked. With respect to TLR3 nucleic acid, the "gene expression sequence" is 
any regulatory nucleotide sequence, such as a promoter sequence or promoter-enhancer 
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combination, which facilitates the efficient transcription and translation of the TLR3 
nucleic acid to which it is operably linked. The gene expression sequence may, for 
example, be a mammalian or viral promoter, such as a constitutive or inducible 
promoter. Constitutive mammalian promoters include, but are not limited to, the 
5 promoters for the following genes: hypoxanthine phosphoribosyl transferase (HPRT), 
adenosine deaminase, pyruvate kinase, p-actin promoter, and other constitutive 
promoters. Exemplary viral promoters which function constitutively in eukaryotic cells 
include, for example, promoters from the simian virus (e.g., SV40), papillomavirus, 
adenovirus, human immunodeficiency virus (HIV), Rous sarcoma virus (RSV), 

10 cytomegalovirus (CMV), the long terminal repeats (LTR) of Moloney murine leukemia 
virus and other retroviruses, and the thymidine kinase (TK) promoter of herpes simplex 
virus. Other constitutive promoters are known to those of ordinary skill in the art. The 
promoters useful as gene expression sequences of the invention also include inducible 
promoters. Inducible promoters are expressed in the presence of an inducing agent. 

15 For example, the metallothionein (MT) promoter is induced to promote transcription 
and translation in the presence of certain metal ions. Other inducible promoters are 
known to those of ordinary skill in the art. 

In general, the gene expression sequence shall include, as necessary, 5 f non- 
transcribing and 5' non-translating sequences involved with the initiation of 

20 transcription and translation, respectively, such as a TATA box, capping sequence, 
CAAT sequence, and the like. Especially, such 5 f non-transcribing sequences will 
include a promoter region which includes a promoter sequence for transcriptional 
control of the operably joined TLR3 nucleic acid. The gene expression sequences 
optionally include enhancer sequences or upstream activator sequences as desired. 

25 Generally a nucleic acid coding sequence and a gene expression sequence are 

said to be "operably linked" when they are covalently linked in such a way as to place 
the transcription and/or translation of the nucleic acid coding sequence under the 
influence or control of the gene expression sequence. Thus the TLR3 nucleic acid 
sequence and the gene expression sequence are said to be "operably linked" when they 

30 are covalently linked in such a way as to place the transcription and/or translation of the 
TLR3 coding sequence under the influence or control of the gene expression sequence. 
If it is desired that the TLR3 sequence be translated into a functional protein, two DNA 
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sequences are said to be operably linked if induction of a promoter in the 5' gene 
expression sequence results in the transcription of the TLR3 sequence and if the nature 
of the linkage between the two DNA sequences does not (1) result in the introduction 
of a frame-shift mutation, (2) interfere with the ability of the promoter region to direct 
5 the transcription of the TLR3 sequence, or (3) interfere: with the ability of the 

corresponding RNA transcript to be translated into a protein. Thus, a gene expression 
sequence would be operably linked to a TLR3 nucleic acid sequence if the gene 
expression sequence were capable of effecting transcription of that TLR3 nucleic acid 
sequence such that the resulting transcript might be translated into the desired protein 

10 or polypeptide. 

In certain embodiments a TLR expression vector is constructed so as to permit 
tandem expression of two distinct TLRs, e.g., both TLR3 and a second TLR. Such a 
tandem expression vector can be used when it is desired to express two TLRs using a 
single transformation or transfection. Alternatively, a TLR3 expression vector can be 

15 used in conjunction with a second expression vector constructed so as to permit 
expression of a second TLR. 

The screening assays can have any of a number of possible readout systems 
based upon a TLR/IL-1R signal transduction pathway. In preferred embodiments, the 
readout for the screening assay is based on the use of native genes or, alternatively, 

20 transfected or otherwise artificially introduced reporter gene constructs which are 
responsive to the TLR/IL-1R signal transduction pathway involving MyD88, TRAF, 
p38, and/or ERK. Hacker H et al. (1999) EMBO J 1 8:6973-82. These pathways 
activate kinases including kB kinase complex and c-Jun N-terminal kinases. Thus 
reporter genes and reporter gene constructs particularly useful for the assays include, 

25 e.g., a reporter gene operatively linked to a promoter sensitive to NF-kB. Examples of 
such promoters include, without limitation, those for NF-kB, IL-ip, EL-6, IL-8, EL- 12 
p40, CD80, CD86, and TNF-a. The reporter gene operatively linked to the TLR- 
sensitive promoter can include, without limitation, an enzyme (e.g., luciferase, alkaline 
phosphatase, (J-galactosidase, chloramphenicol acetyltransferase (GAT), etc.), a 

30 bioluminescence marker (e.g., green-fluorescent protein (GFP, U.S. patent 5,491,084), 
etc.), a surface-expressed molecule (e.g., CD25), and a secreted molecule (e.g., IL-8, 
IL-12 p40, TNF-a). In certain preferred embodiments the reporter is selected from IL- 
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8, TNF-a, NF-KB-luciferase (NF-kB-Iuc; Hacker H et al. (1999) EMBO J 18:6973-82), 
IL-12 p40-luc (Murphy TL et al. (1 995) Mol Cell Biol 1 5:5258-67), and TNF-luc 
(Hacker H et al. (1999) EMBO J 1 8:6973-82). In assays relying on enzyme activity 
readout, substrate can be supplied as part of the assay, and detection can involve 

5 measurement of chemiluminescence, fluorescence, color development, incorporation of 
radioactive label, drug resistance, or other marker of enzyme activity. For assays 
relying on surface expression of a molecule, detection can be accomplished using flow 
cytometry (FACS) analysis or functional assays. Secreted molecules can be assayed 
using enzyme-linked immunosorbent assay (ELIS A) or bioassays. These and other 

10 suitable readout systems are well known in the art and are commercially available. 

Thus a cell expressing a functional TLR3 and useful for the methods of the 
invention has, in some embodiments, an expression vector comprising an isolated 
nucleic acid which encodes a reporter construct useful for detecting TLR signaling. 
The expression vector comprising an isolated nucleic acid which encodes a reporter 

15 construct useful for detecting TLR signaling can include a reporter gene under control 
of a minimal promoter responsive to a transcription factor believed by the applicant to 
be activated as a consequence of TLR3 signaling. Examples of such minimal 
promoters include, without limitation, promoters for the following genes: API, NF-kB, 
ATF2, IRF3, and IRF7. In other embodiments the expression vector comprising an 

20 isolated nucleic acid which encodes a reporter construct useful for detecting TLR 
signaling can include a gene under control of a promoter response element selected 
from IL-6, IL-8, IL-12 p40 subunit, a type 1 IFN, RANTES, TNF, IP-10, 1-TAC, and 
ISRE. The promoter response element generally will be present in multiple copies, 
e.g., as tandem repeats. For example, an ISRE-luciferase reporter construct useful in 

25 the invention is available from Stratagene (catalog no. 219092) and includes a 5x ISRE 
tandem repeat joined to a TATA box upstream of a luciferase reporter gene. As 
discussed further elsewhere herein, the reporter itself can be any gene product suitable 
for detection by methods recognized in the art. Such methods for detection can include, 
for example, measurement of spontaneous or stimulated light emission, enzyme 

30 activity, expression of a soluble molecule, expression of a cell surface molecule, etc. 

As mentioned above, the functional TLR3 is contacted with a test compound in 
order to identify an immunostimulatory compound. An immunostimulatory compound 
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is a natural or synthetic compound that is capable of inducing an immune response 

i 

when contacted with an immune cell. In the context of the methods of the invention, an 
immunostimulatory compound refers to a natural or synthetic compound that is capable 
of inducing an immune response when contacted with an immune cell expressing a 

5 functional TLR3 polypeptide. Preferably the immune response is or involves activation 
of a TLR3 signal transduction pathway. Thus immunostimulatory compounds 
identified and characterized using the methods of the invention specifically include 
TLR3 ligands, i.e., compounds which selectively bind to TLR3 and induce a TLR3 
signal transduction pathway. Immunostimulatory compounds in general include but are 

10 not limited to nucleic acids, including oligonucleotides and polynucleotides; 

oligopeptides; polypeptides; lipids, including lipopolysaccharides; carbohydrates, 
including oligosaccharides and polysaccharides; and small molecules. Accordingly, a 
"test compound" refers to nucleic acids, including oligonucleotides and 
polynucleotides; oligopeptides; polypeptides; lipids, including lipopolysaccharides; 

15 carbohydrates, including oligosaccharides and polysaccharides; and small molecules. 
Test compounds include compounds with known biological activity as well as 
compounds without known biological activity. 

A "reference immunostimulatory compound" refers to an immunostimulatory 
compound that characteristically induces an immune response when contacted with an 

20 immune cell expressing a functional TLR polypeptide. In the screening methods of the 
invention, the reference immunositmulatory compound is a natural or synthetic 
compound that that characteristically induces an immune response when contacted with 
an immune cell expressing a functional TLR3 polypeptide. Preferably the immune 
response is or involves activation of a TLR3 signal transduction pathway. Thus a 

25 reference immunostimulatory compound will characteristically induce a reference 
response mediated by a TLR3 signal transduction pathway when contacted with a 
functional TLR3 under suitable conditions. The reference response can be measured 
according to any of the methods described herein. Importantly, a reference 
immunostimulatory compound specifically includes a test compound identified as an 

30 immunostimulatory compound according to any one of the methods of the invention. 
Therefore a reference immunostimulatory compound can be a nucleic acid, including 
oligonucleotides and polynucleotides; an oligopeptide; a polypeptide; a lipid, including 
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lipopolysaccharides; a carbohydrate, including oligosaccharides and polysaccharides; 
or a small molecule. 

Small molecules include naturally occurring, synthetic, and semisynthetic 
organic and organometallic compounds with molecular weight less than about 1.5 kDa. 

5 Examples of small molecules include most drugs, subunits of polymeric materials, and 
analogs and derivatives thereof. 

A "nucleic acid" as used herein with respect to test compounds and reference 
compounds used in the methods of the invention, shall refer to any polymer of two or 
more individual nucleoside or nucleotide units. Typically individual nucleoside or 

JO nucleotide units will include any one or combination of deoxyribonucleosides, 

ribonucleosides, deoxyribonucleotides, and ribonucleotides. The individual nucleotide 
or nucleoside units of the nucleic acid can be naturally occurring or not naturally 
occurring. For example, the individual nucleotide units can include deoxyadenosine, 
deoxycytidine, deoxyguanosine, thymidine, and uracil. In addition to naturally 

15 occurring 2 , -deoxy and 2-hydroxyl forms, individual nucleosides also include synthetic 
nucleosides having modified base moieties and/or modified sugar moieties, e.g., as 
described in Uhlmann E et al. (1990) Chem Rev 90:543-84. The linkages between 
individual nucleotide or nucleoside units can be naturally occurring or not naturally 
occurring. For example, the linkages can be phosphodiester, phosphorothioate, 

20 phosphorodithioate, phosphoramidate, as well as peptide linkages and other covalent 
linkages, known in the art, suitable for joining adjacent nucleoside or nucleotide units. 
The nucleic acid test compounds and nucleic acid reference compounds typically range 
in size from 3-4 units to a few tens of units, e.g., 18-40 units. 

The substituted purines and pyrimidines of the ISNAs include standard purines 

25 and pyrimidines such as cytosine as well as base analogs such as C-5 propyne 

substituted bases. Wagner RW et al. (1996) Nat Biotechnol 14:840-4. Purines and 
pyrimidines include but are not limited to adenine, cytosine, guanine, thymine, 5- 
methylcytosine, 2-aminopurine, 2-amino-6-chloropurine, 2,6-diaminopurine, 
hypoxanthine, and other naturally and non-naturally occurring nucleobases, substituted 

30 and unsubstituted aromatic moieties. 

Libraries of compounds that can be used as test compounds are available from 
various commercial suppliers, and they can be made to order using techniques well 

-15- 

1 



WO 03/031573 



PCT/US02/31460 



known in the art, including combinatorial chemistry techniques. Especially in 
combination with high throughput screening methods, such methods including in 
particular automated multichannel methods of screening, large libraries of test 
compounds can be screened according to the methods of the invention. Large libraries 
5 can include hundreds, thousands, tens of thousands, hundreds of thousands, and even 
millions of compounds. 

Thus in preferred embodiments, the methods for screening test compounds can 
be performed on a large scale and with high throughput by incorporating, e.g., an array- 
based assay system and at least one automated or semi-automated step. For example, 

JO the assays can be set up using multiple-well plates in which cells are dispensed in 
individual wells and reagents are added in a systematic manner using a multiwell 
delivery device suited to the geometry of the multiwell plate. Manual and robotic 
multiwell delivery devices suitable for use in a high throughput screening assay are 
well known by those skilled in the art. Each well or array element can be mapped in a 

15 one-to-one manner to a particular test condition, such as the test compound. Readouts 
can also be performed in this multiwell array, preferably using a multiwell plate reader 
device or the like. Examples of such devices are well known in the art and are 
available through commercial sources. Sample and reagent handling can be automated 
to further enhance the throughput capacity of the screening assay, such that dozens, 

20 hundreds, thousands, or even millions of parallel assays can be performed in a day or in 
a week. Fully robotic systems are known in the art for applications such as generation 
and analysis of combinatorial libraries of synthetic compounds. See, for example, U.S. 
patents 5,443,791 and 5,708,158. 

A "CpG nucleic acid" or a "CpG immunostimulatory nucleic acid" as used 

25 herein is a nucleic acid containing at least one unmethylated CpG dinucleotide 
(cytosine-guanine dinucleotide sequence, i.e. "CpG DNA" or DNA containing a 5' 
cytosine followed by 3' guanine and linked by a phosphate bond) and activates a 
component of the immune system. The entire CpG nucleic acid can be unmethylated or 
portions may be unmethylated but at least the C of the 5 r CG 3 ? must be unmethylated. 

30 In one embodiment a CpG nucleic acid is represented by at least the formula: 

5 , -NiXiCGX 2 N 2 -3' 
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wherein Xi and X 2 are nucleotides, N is any nucleotide, and Ni and N2 are nucleic acid 
sequences composed of from about 0-25 N's each. In some embodiments Xj is 
adenine, guanine, or thymine and/or X 2 is cytosine, adenine, or thymine. In other 
embodiments X] is cytosine and/or X2 is guanine. 
5 Examples of CpG nucleic acids according to the invention include but are not 

limited to those listed in Table 1. 

Table 1. Exemplary CpG Nucleic Acids 

AACGTTCT 

10 AAGCGAAAATGAAATTGACT SEQ ID NO:39 

ACCATGGACGAACTGTTTCCCCTC SEQ ID NO:40 

ACCATGGACGACCTGTTTCCCCTC SEQ ID NO:41 

ACCATGGACGAGCTGTTTCCCCTC SEQ ID NO:42 

ACCATGGACGATCTGTTTCCCCTC SEQ ID NO:43 

15 ACCATGGACGGTCTGTTTCCCCTC SEQ ID NO:44 

ACCATGGACGTACTGTTTC CCCTC SEQ ID NO:45 

ACCATGGACGTTCTGTTTCCCCTC SEQ ID NO:46 

AGCGGGGGCGAGCGGGGGCG SEQ ID NO:47 

AGCTATGACGTTCCAAGG SEQ ID NO:48 

20 ATCGACTCTCGAGCGTTCTC SEQ ID N<D:49 

ATGACGTTCCTGACGTT SEQ ID NO:50 

ATGGAAGGTCCAACGTTCTC SEQ ID NO:5 1 

ATGGAAGGTCCAGCGTTCTC SEQ ID NO:52 

ATGGACTCTCCAGCGTTCTC SEQ ID NO:53 

25 ATGGAGGCTCCATCGTTCTC SEQ ID NO:54 
CAACGTT 

CACGTTGAGGGGCAT SEQ ID NO:55 

CAGGCATAACGGTTCCGTAG SEQ ID NO:56 
CCAACGTT 

30 CTGATTTCCCCGAAATGATG SEQ ID NO:57 

GAGAACGATGGACCTTCCAT SEQ ED NO:58 

GAGAACGCTCCAGCACTGAT SEQ ED NO:59 

GAGAACGCTCGACCTTCCAT SEQ ID NO:60 

GAGAACGCTCGACCTTCGAT SEQ ID NO:6 1 

35 GAGAACGCTGGACCTTCCAT SEQ ID NO:62 

GATTGCCTGACGTCAGAGAG SEQ ID NO:63 

GCATGACGTTGAGCT SEQ ID NO:64 

GCGGCGGGCGG CGCGCG CCC SEQ ID NO:65 

GCGTGCGTTGTCGTTGTCGTT SEQ ID NO:66 

40 GCTAGACGTTAGCGT SEQ ID NO:67 

GCTAGACGTTAGTGT SEQ ID N©:68 

GCTAGATGTTAGCGT SEQ ID NO: 69 

GCTTG ATG ACTC AG CCGGAA SEQ ID NO:70 

GGAATGACGTTCCCTGTG SEQ ID NO:71 
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GGGGTCAACGTTGACGGGG 

GGGGTCAGTCTTGACGGGG 

GTCCATTTCCCGTAAATCTT 
GTCGCT 
5 GTCGTT 

TACCGCGTGCGACCCTCT 
TCAACGTC 
TCAACGTT 
TCAGCGCT 

10 TCAGCGTGCGCC 
TCATCGAT 

TCCACGACGTTTTCGACGTT 
TCCATAACGTTCCTGATGCT 
TCCATAGCGTTCCTAGCGTT 

15 TCCATCACGTGCCTGATGCT 
TCCATGACGGTCCTGATGCT 
TCCATGACGTCCCTGATGCT 
TCCATGACGTGCCTGATGCT 
TCCATGACGTTCCTGACGTT 

20 TCCATGACGTTCCTGATGCT 
TCCATGCCGGTCCTGATGCT 
TCCATGCGTGCGTGCGTTTT 
TCCATGCGTTGCGTTGCGTT 
TCCATGGCGGTCCTGATGCT 

25 TCCATGTCGATCCTGATGCT 
TCCATGTCGCTCCTGATGCT 
TCCATGTCGGTCCTGATGCT 
TCCATGTCGGTCCTGCTGAT 
TCCATGTCGTCCCTGATGCT 

30 TCCATGTCGTTCCTGATGCT 
TCCATGTCGTTCCTGTCGTT 
TCCATGTCGTTTTTGTCGTT 
TCCTGACGTTCCTGACGTT 
TCCTGTCGTTCCTGTCGTT 

35 TCCTGTCGTTCCTTGTCGTT 
TCCTGTCGTTTTTTGTCGTT 
TCCTTGTCGTTCCTGTCGTT 
TCGATCGGGGCGGGGCGAGC 
TCGTCGCTGTCTCCGCTTCTT 

40 TCGTCGCTGTCTCCGCTTCTTCTTGCC 
TCGTCGCTGTCTGCCCTTCTT 
TCGTCGCTGTTGTCGTTTCTT 
TCGTCGTCGTCGTT 
TCGTCGTTGTCGTTGTCGTT 

45 TCGTCGTTGTCGTTTTGTCGTT 

TCGTCGTTTTGTCGTTTTGTCGTT 

TCTCCCAGCGCGCGCCAT 

TCTCCCAGCGGGCGCAT 



SEQIDNO:72 
SEQIDNO:73 
SEQIDNO:74 



SEQIDNO:75 



SEQIDNO:76 

SEQ ID N0:77 

SEQ ID NO:78 

SEQ ID NO:79 

SEQIDNO:80 

SEQIDNO:81 

SEQIDNO:82 

SEQBDNO:83 

SEQ ID NO:84 

SEQIDNO:18 

SEQIDNO:85 

SEQIDNO:86 

SEQIDNO:87 

SEQDDNO:88 

SEQIDNO:89 

SEQIDNO:90 

SEQIDNO:91 

SEQIDNO:92' 

SEQIDNO:93 

SEQ ID NO:94 

SEQIDNO:95 

SEQE)NO:96 

SEQIDNO:97 

SEQIDNO:98 

SEQIDNO:99 

SEQDDNO:100 

SEQIDNO:101 

SEQEDNO:102 

SEQIDNO:103 

SEQIDNO:104 

SEQIDNO:105 

SEQBDNO:106 

SEQIDNO:107 

SEQ ID NO: 108 

SEQ ID NO: 109 

SEQ ID NO: 15 

SEQIDNO:110 

SEQIDNO:lll 
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TCTCCCAGCGTGCGCCAT 
TCTTCGAA 

TGCAGATTGCGCAATCTGCA 

TGTCGCT 

TGTCGTT 

TGTCGTTGTCGTT 

TGTCGTTGTCGTTGTCGTT 



SEQIDNO:112 



SEQIDNO:113 



TGTCGTTTGTCGTTTGTCGTT 



TGTCGTTGTCGTTGTCGTTGTCGTT 



SEQIDNO:114 
SEQIDNO:115 
SEQDDNO:116 
SEQIDNO:117 



10 



15 



20 



25 



30 



As used herein the term "response mediated by a TLR signal transduction 
pathway" refers to a response which is characteristic of an interaction between a TLR 
and an immunostimulatory compound that induces signaling events through the TLR. 
Such responses typically involve usual elements of Toll/IL-IR signaling, e.g., MyD88, 
TRAF, and IRAK molecules, although in the case of TLR3 the role of MyD88 is less 
clear than for other TLR family members. As demonstrated herein such responses 
include the induction of a gene under control of a specific promoter such as a NF-kB 
promoter, increases in particular cytokine levels, increases in particular chemokine 
levels etc. The gene under the control of the NF-kB promoter may be a gene which 
naturally includes an NF-kB promoter or it may be a gene in a construct in which an 
NF-kB promoter has been inserted. Genes which naturally include the NF-kB 
promoter include but are not limited to IL-8, IL-12 p40, NF-kB-1uc, IL-12 p40-luc, and 
TNF-luc. Increases in cytokine levels may result from increased production or 
increased stability or increased secretion of the cytokines in response to the TLR- 
immunostimulatory compound interaction. Thl cytokines include but are not limited to 
IL-2, IFN-y, and IL-12. It has unexpectedly been discovered, according to the instant 
invention, that the promoter response element ISRE is directly activated as a result of 
signaling through the TLR3 signal transduction pathway, i.e., independent of DFN-y 
production. Th2 cytokines include but are not limited to BL-4, IL-5, and IL-10. 
Chemokines of particular significance in the invention ihclude but are not limited to 
CCL5 (RANTES), CXCL9 (Mig), CXCL10 (IP-10), and CXCL1 1 (I-TAC). 

In another aspect the invention provides a screening method for identifying a 
compound that modulates TLR3 signaling activity. The method according to this 
aspect of the invention involves the steps of (a) contacting a functional TLR3 with a 
test compound and a reference immunostimulatory compound under conditions which, 
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in presence of the reference immunostimulatory compound alone, permit a reference 
response mediated by a TLR3 signal transduction pathway; (b) detecting a test- 
reference response mediated by the TLR3 signal transduction pathway; (c) determining 
the test compound is an agonist of TLR3 signaling activity when the test-reference 
5 response exceeds the reference response; and (d) determining the test compound is an 
antagonist of TLR3 signaling activity when the reference response exceeds the test- 
reference response. A test-reference response refers to a type of test response as 
determined when a test compound and a reference immunostimulatory compound are 
simultaneously contacted with the TLR3. When a test compound is neither an agonist 

10 nor an antagonist of TLR3 signaling activity, the test-reference response and the 
reference response are indistinguishable. 

An agonist as used herein is a compound which causes an enhanced response of 
a TLR to a reference stimulus. The enhanced response can be additive or synergistic 
with respect to the response to the reference stimulus by itself. Furthermore, an agonist 

15 can work directly or indirectly to cause the enhanced response. Thus an agonist of 
TLR3 signaling activity as used herein is a compound which causes an enhanced 
response of a TLR to a reference stimulus. ■ 

An antagonist as used herein is a compound which causes a diminished 
response of a TLR to a reference stimulus. Furthermore, an antagonist can work 

20 directly or indirectly to cause the diminished response. Thus an antagonist of TLR3 
signaling activity as used herein is a compound which causes a diminished response of 
a TLR to a reference stimulus. 

In addition to identification and characterization of immunostimulatory 
compounds, agonists of TLR3 signaling, and antagonists of TLR3 signaling, the 

25 methods of the invention also permit optimization of lead compounds. Optimization of 
a lead compound involves an iterative application of a screening method of the 
invention, further including the steps of selecting the best candidate at any given stage 
or round in the screening and then substituting it as a benchmark or reference in a 
subsequent round of screening. This latter process can further include selection of 

30 parameters to modify in choosing and generating candidate test compounds to screen. 
For example, a lead compound from a particular round of screening can be used as a 
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basis to develop a focused library of new test compounds for use in a subsequent round 
of screening. 

In another aspect the invention provides a screening method for identifying 
species specificity of an immunostimulatory compound. The method according to this 

5 aspect of the invention involves the steps of (a) measuring a first species-specific 

response mediated by a TLR3 signal transduction pathway when a functional TLR3 of 
a first species is contacted with a test compound; (b) measuring a second species- 
specific response mediated by the TLR3 signal transduction pathway when a functional 
TLR3 of a second species is contacted with the test compound; and (c) comparing the 

10 first species-specific response with the second species-specific response. 

A species-specific TLR, including TLR3, is not limited to a human TLR, but 
rather can include a TLR derived from human or non-human sources. Examples of 
non-human sources include, but are not limited to, murine, rat, bovine, canine, feline, 
ovine, porcine, and equine. Other species include chicken and fish, e.g., aquaculture 

15 species. 

The species-specific TLR, including TLR3, also is not limited to native TLR 
polypeptides. In certain embodiments the TLR can be, e.g., a chimeric TLR in which 
the extracellular domain and the cytoplasmic domain are derived from TLR 
polypeptides from different species. Such chimeric TLR polypeptides, as described 

20 above, can include, for example, a human TLR extracellular domain and a murine TLR 
cytoplasmic domain, each domain derived from the corresponding TLR of each 
species. In alternative embodiments, such chimeric TLR polypeptides can include 
chimeras created with different TLR splice variants or allotypes. Other chimeric TLR 
polypeptides useful for the screening methods of the invention include chimeric 

25 polypeptides created with a TLR of a first type, e.g., TLR3, and another TLR, e.g., 
TLR7, TLR8, or TLR9, of the same or another species as the TLR of the first type. 
Also contemplated are chimeric polypeptides which incorporate sequences derived 
from more than two polypeptides, e.g., an extracellular domain, a transmembrane 
domain, and a cytoplasmic domain all derived from different polypeptide sources, 

30 provided at least one such domain derives from a TLR3 polypeptide. As a further 
example, also contemplated are constructs such as include an extracellular domain of 
one TLR3, an intracellular domain of another TLR3, and a non-TLR reporter such as 
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luciferase, GFP, etc. Those of skill in the art will recognize how to design and generate 
DNA sequences coding for such chimeric TLR polypeptides. 

It has also been discovered, according to the instant invention, that TLR-based 
screening assays, including but not limited to the TLR3-based assays described herein, 
5 are sensitive to parameters such as concentration of test compound, stability of test 
compound, kinetics of detection, and selection of reporter. These parameters can be 
optimized in order to derive the most information from a given screening assay. 
Importantly, the kinetics of detection appear to afford separation of types of 
information such as affinity of interaction and stability or duration of interaction. For 

10 example, measurements taken at earlier timepoints, e.g., after 6-8 hours of contact 
between TLR and test and/or reference compound, appear to reflect more information 
about affinity of interaction than do measurements obtained at later timepoints, e.g., 
after 16-24 or more hours of contact. In addition, while NF-KB-driven reporters are 
generally useful in TLR-based screening assays like those of the instant invention, in 

15 some instances a reporter other than an NF-KB-driven reporter will afford greater 

sensitivity. For example, the IL-8-luc reporter is significantly more sensitive to TLR7 
and TLR8 than NF-kB-1uc. Selection of reporter thus appears to be TLR-dependent, 
while parameters relating to kinetics and concentration appear to be more compound- 
dependent. Thus in performing the screening methods of the instant invention, it is 

20 expected that the methods will be enhance by inclusion of measurements obtained 
using at least two concentrations and two time points for each test compound. 
Typically at least three concentrations will be employed, spanning a two to three log- 
fold range of concentrations. Finer ranges of concentration can of course be employed 
under suitable circumstances, for instance based on results of an earlier screening 

25 performed using a wider initial range of concentrations. 

The invention will be more fully understood by reference to the following 
examples. These examples, however, are merely intended to illustrate certain 
embodiments of the invention and are not to be construed to limit the scope of the 
invention. 

30 

Examples 
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Example 1. Expression Vectors for Human TLR3 (hTLR3) and Murine TLR3 
(mTLR3) 

To create an expression vector for human TLR3, human TLR3 cDNA was 
amplified by the polymerase chain method (PCR) from a cDNA made from human 293 
5 cells using the primers 

S'-GAAACTCGAGCCACCATGAGACAGACTTTGCCTTGTATCTAC-S 1 (sense, 
SEQ ID NO:9) and 5 , -GAAAGAATTCTTAATGTACAGAGTTTTTGGATCCAAG-3 , 
(antisense, SEQ ID NO: 10). The primers introduce Xho I and EcoRI restriction 
endonuclease sites at their 5* ends for use in subsequent cloning into the expression 

JO vector. The resulting amplication product fragment was cloned into pGEM-T Easy 
vector (Promega), isolated, cut with Xho I and EcoRI restriction endonucleases, ligated 
into an Xho VEcoRI-digested pcDNA3.1 expression vector (Invitrogen), The insert 
was fully sequenced and translated into protein. The cDNA sequence corresponds to 
the published cDNA sequence for hTLR3, available as GenBank accession no. 

15 NM_003265 (SEQ ID NO: 1). The open reading frame codes for a protein 904 amino 
acids long, having the sequence corresponding to GenBank accession no. NP_003256 
(SEQIDNO:2). 

Table 2. cDNA Sequence for Human TLR3 
20 (GenBank Accession No. NM 003265; SEP ID NO:l) 

gcggccgcgt cgacgaaatg tctggatttg gactaaagaa aaaaggaaag gctagcagtc 60 
atccaacaga atcatgagac agactttgcc ttgtatctac ttttgggggg gccttttgcc 120 
ctttgggatg ctgtgtgcat cctccaccac caagtgcact gttagccatg aagttgctga 180 
ctgcagccac ctgaagttga ctcaggtacc cgatgatcta cccacaaaca taacagtgtt 24 0 

25 gaaccttacc cataatcaac tcagaagatt accagccgcc aacttcacaa ggtatagcca 300 
gctaactagc ttggatgtag gatttaacac catctcaaaa ctggagccag aattgtgcca 360 
gaaacttccc atgttaaaag ttttgaacct ccagcacaat gagctatctc aactttctga 420 
taaaaccttt gccttctgca cgaatttgac tgaactccat ctcatgtcca actcaatcca 480 
gaaaattaaa aataatccct ttgtcaagca gaagaattta atcacattag atctgtctca 540 

30 taatggcttg tcatctacaa aattaggaac tcaggttcag ctggaaaatc tccaagagct 600 
tctattatca aacaataaaa ttcaagcgct aaaaagtgaa gaactggata tctttgccaa 660 
ttcatcttta aaaaaattag agttgtcatc gaatcaaatt aaagagtttt ctccagggtg 720 
ttttcacgca attggaagat tatttggcct ctttctgaac aatgtccagc tgggtcccag 780 
ccttacagag aagctatgtt tggaattagc aaacacaagc attcggaatc tgtctctgag 840 

35 taacagccag ctgtccacca ccagcaatac aactttcttg ggactaaagt ggacaaatct 900 
cactatgctc gatctttcct acaacaactt aaatgtggtt ggtaacgatt cctttgcttg 960 
gcttccacaa ctagaatatt tcttcctaga gtataataat atacagcatt tgttttctca 1020 
ctctttgcac gggcttttca atgtgaggta cctgaatttg aaacggtctt ttactaaaca 1080 
aagtatttcc cttgcctcac tccccaagat tgatgatttt tcttttcagt ggctaaaatg 1140 

40 tttggagcac cttaacatgg aagataatga tattccaggc ataaaaagca atatgttcac 1200 
aggattgata aacctgaaat acttaagtct atccaactcc tttacaagtt tgcgaacttt 1260 
gacaaatgaa acatttgtat cacttgctca ttctccctta cacatactca acctaaccaa 1320 
gaataaaatc tcaaaaatag agagtgatgc tttctcttgg ttgggccacc tagaagtact 1380 
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tgacctgggc cttaatgaaa ttgggcaaga 

aaatattttc gaaatctatc tttcctacaa 

tgccttggtc ccaagccttc aacgactgat 

tagctctcct tcaccattcc agcctcttcg 

5 caacatagcc aacataaatg atgacatgtt 

tttgcagcat aacaacttag cacggctctg 

tttcctaaag ggtctgtctc acctccacat 

gatcccagtt gaggtcttca aggatttatt 

taatttaaac acacttccag catctgtctt 

JO ccttcagaag aatctcataa catccgttga 

cctgactgag ttagatatgc gctttaatcc 

gtttgttaat tggattaacg agacccatac 

ttgcaacact ccacctcact atcatgggtt 

caaagacagt gccccctttg aactcttttt 

15 tatctttatt gtacttctca tccactttga 

ttcagtacat cgagttcttg gtttcaaaga 

tgcagcatat ataattcatg cctataaaga 

aatggaaaag gaagaccaat ctctcaaatt 

tgtttttgaa ctagaagcaa ttgttaacag 

20 tataacacac catctattaa aagacccatt 

tcaacaagct attgaacaaa atctggattc 

agattataaa ctgaaccatg cactctgttt 

cttgaactgg ccagttcaga aagaacggat 

acttggatcc aaaaactctg tacattaaat 

25 actttctcaa tttaaaaagt tctatggcaa 

tgtttattca tatttgtaaa tgattatatt 

gtgtctcctt atttcaggcc tatttttgac 

tataagcacg caaaaaaaaa aaaaaaaaa 



actcacaggc caggaatgga gaggtctaga 144 0 

caagtacctg cagctgacta ggaactcctt 1500 

gctccgaagg gtggccctta aaaatgtgga 1560 

taacttgacc attctggatc taagcaacaa 162 0 

ggagggtctt gagaaactag aaattctcga 1680 

gaaacacgca aaccctggtg gtcccattta 174 0 

ccttaacttg gagtccaacg gctttgacga 1800 

tgaactaaag atcatcgatt taggattgaa 1860 

taataatcag gtgtctctaa agtcattgaa 1920 

gaagaaggtt ttcgggccag ctttcaggaa 1980 

ctttgattgc acgtgtgaaa gtattgcctg 2040 

caacatccct gagctgtcaa gccactacct 2100 

cccagtgaga ctttttgata catcatcttg 2160 

catgatcaat accagtatcc tgttgatttt 2220 

gggctggagg atatcttttt attggaatgt 2280 

aatagacaga cagacagaac agtttgaata 2340 

taaggattgg gtctgggaac atttctcttc 2400 

ttgtctggaa gaaagggact ttgaggcggg 2460 

catcaaaaga agcagaaaaa ttatttttgt 2520 

atgcaaaaga ttcaaggtac atcatgcagt 2580 

cattatattg gttttccttg aggagattcc 2640 

gcgaagagga atgtttaaat ctcactgcat 2700 

aggtgccttt cgtcataaat tgcaagtagc 2760 

ttatttaaat attcaattag caaaggagaa 2820 

atttaagttt tccataaagg tgttataatt 2 880 

ctatcacaat tacatctctt ctaggaaaat 2940 

aattgactta attttaccca aaataaaaca 3000 

3029 



30 Table 3. Amino Acid Sequence for Human TLR3 

(GenBank Accession No. NP 003256; SEO ID NO:2) 

MRQTLPCIYF WGGLLPFGML CASSTTKCTV SHEVADCSHL KLTQVPDDLP TNITVLNLTH 60 

NQLRRLPAAN FTRYSQLTSL DVGFNTISKL EPELCQKLPM LKVLNLQHNE LSQLSDKTFA 120 

FCTNLTELHL MSNSIQKIKN NPFVKQKNLI TLDLSHNGLS STKLGTQVQL ENLQELLLSN 180 

35 NKIQALKSEE LDIFANSSLK KLELSSNQIK EFSPGCFHAI GRLFGLFLNN VQLGPSLTEK 240 

LCLELANTSI RNLSLSNSQL STTSNTTFLG LKWTNLTMLD LSYNNLNWG NDSFAWLPQL 300 

EYFFLEYNNI QHLFSHSLHG LFNVRYLNLK RSFTKQSISL ASLPKIDDFS FQWLKCLEHL 360 

NMEDNDIPGI KSNMFTGLIN LKYLSLSNSF TSLRTLTNET FVSLAHSPLH ILNLTKNKIS 420 

KIESDAFSWL GHLEVLDLGL NEIGQELTGQ EWRGLENIFE IYLSYNKYLQ LTRNSFALVP 4 80 

40 SLQRLMLRRV ALKNVDSSPS PFQPLRNLTI LDLSNNNIAN INDDMLEGLE KLEILDLQHN 540 

NLARLWKHAN PGGPIYFLKG LSHLHILNLE SNGFDEIPVE VFKDLFELKI IDLGLNNLNT 600 

LPASVFNNQV SLKSLNLQKN LITSVEKKVF GPAFRNLTEL DMRFNPFDCT CESIAWFVNW 660 

INETHTNIPE LSSHYLCNTP PHYHGFPVRL FDTSSCKDSA PFELFFMINT SILLIFIFIV 720 

LLIHFEGWRI SFYWNVSVHR VLGFKEIDRQ TEQFEYAAYI IHAYKDKDWV WEHFSSMEKE 7 80 

45 DQSLKFCLEE RDFEAGVFEL EAIVNSIKRS RKIIFVITHH LLKDPLCKRF KVHHAVQQAI 840 

EQNLDSIILV FLEEIPDYKL NHALCLRRGM FKSHCILNWP VQKERIGAFR HKLQVALGSK 900 

NSVH 904 



Corresponding nucleotide and amino acid sequences for murine TLR3 
50 (mTLR3) are known. The nucleotide sequence of mTLR3 cDNA has been reported as 
GenBank accession no. AF355152, and the amino acid sequence of mTLR3 has been 
reported as GenBank accession no. AAK261 17. 
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Table 4. cDNA Sequence for Murine TLR3 

(GenBank Accession No. AF355152: SEP ID NO:3) 

tagaatatga tacagggatt gcacccataa tctgggctga atcatgaaag ggtgttcctc 60 
5 ttatctaatg tactcctttg ggggactttt gtccctatgg attcttctgg tgtcttccac 120 
aaaccaatgc actgtgagat acaacgtagc tgactgcagc patttgaagc taacacacat 180 
acctgatgat cttccctcta acataacagt gttgaatctt actcacaacc aactcagaag 24 0 
attaccacct accaacttta caagatacag ccaacttgct atcttggatg caggatttaa 300 
ctccatttca aaactggagc cagaactgtg ccaaatactc cctttgttga aagtattgaa 360 

10 cctgcaacat aatgagctct ctcagatttc tgatcaaacc tttgtcttct gcacgaacct 420 
gacagaactc gatctaatgt ctaactcaat acacaaaatt aaaagcaacc ctttcaaaaa 4 80 
ccagaagaat ctaatcaaat tagatttgtc tcataatggt ttatcatcta caaagttggg 540 
aacgggggtc caactggaga acctccaaga actgctctta gcaaaaaata aaatccttgc 600 
gttgcgaagt gaagaacttg agtttcttgg caattcttct ttacgaaagt tggacttgtc 660 

15 atcaaatcca cttaaagagt tctccccggg gtgtttccag acaattggca agttattcgc 720 
cctcctcttg aacaacgccc aactgaaccc ccacctcaca gagaagcttt gctgggaact 780 
ttcaaacaca agcatccaga atctctctct ggctaacaac cagctgctgg ccaccagcga 84 0 
gagcactttc tctgggctga agtggacaaa tctcacccag ctcgatcttt cctacaacaa 900 
cctccatgat gtcggcaacg gttccttctc ctatctccca agcctgaggt atctgtctct 960 

20 ggagtacaac aatatacagc gtctgtcccc tcgctctttt tatggactct ccaacctgag 1020 
gtacctgagt ttgaagcgag catttactaa gcaaagtgtt tcacttgctt cacatcccaa 1080 
cattgacgat ttttcctttc aatggttaaa atatttggaa tatctcaaca tggatgacaa 1140 
taatattcca agtaccaaaa gcaatacctt cacgggattg gtgagtctga agtacctaag 12 00 
tctttccaaa actttcacaa gtttgcaaac tttaacaaat gaaacatttg tgtcacttgc 1260 

25 tcattctccc ttgctcactc tcaacttaac gaaaaatcac atctcaaaaa tagcaaatgg 1320 
tactttctct tggttaggcc aactcaggat acttgatctc ggccttaatg aaattgaaca 1380 
aaaactcagc ggccaggaat ggagaggtct gagaaatata tttgagatct acctatccta 144 0 
taacaaatac ctccaactgt ctaccagttc ctttgcattg gtccccagcc ttcaaagact 1500 
gatgctcagg agggtggccc ttaaaaatgt ggatatctcc ccttcacctt tccgccctct 1560 

30 tcgtaacttg accattctgg acttaagcaa caacaacata gccaacataa atgaggactt 1620 
gctggagggt cttgagaatc tagaaatcct ggattttcag cacaataact tagccaggct 1680 
ctggaaacgc gcaaaccccg gtggtcccgt taatttcctg aaggggctgt ctcacctcca 1740 
catcttgaat ttagagtcca acggcttaga tgaaatccca gtcggggttt tcaagaactt 1800 
attcgaacta aagagcatca atctaggact gaataactta aacaaacttg aaccattcat 1860 

35 ttttgatgac cagacatctc taaggtcact gaacctccag aagaacctca taacatctgt 1920 
tgagaaggat gttttcgggc cgccttttca aaacctgaac agtttagata tgcgcttcaa 1980 
tccgttcgac tgcacgtgtg aaagtatttc ctggtttgtt aactggatca accagaccca 2040 
cactaatatc tttgagctgt ccactcacta cctctgtaac actccacatc attattatgg 2100 
cttccccctg aagcttttcg atacatcatc ctgtaaagac agcgccccct ttgaactcct 2160 

40 cttcataatc agcaccagta tgctcctggt ttttatactt gtggtactgc tcattcacat 2220 
cgagggctgg aggatctctt tttactggaa tgtttcagtg catcggattc ttggtttcaa 22 80 
ggaaatagac acacaggctg agcagtttga atatacagcc tacataattc atgcccataa 234 0 
agacagagac tgggtctggg aacatttctc cccaatggaa gaacaagacc aatctctcaa 2400 
attttgccta gaagaaaggg actttgaagc aggcgtcctt ggacttgaag caattgttaa 24 60 

45 tagcatcaaa agaagccgaa aaatcatttt cgttatcaca caccatttat taaaagaccc 2520 
tctgtgcaga agattcaagg tacatcacgc agttcagcaa gctattgagc aaaatctgga 2580 
ttcaattata ctgatttttc tccagaatat tccagattat aaactaaacc atgcactctg 2640 
tttgcgaaga ggaatgttta aatctcattg catcttgaac tggccagttc agaaagaacg 2700 
gataaatgcc tttcatcata aattgcaagt agcacttgga tctcggaatt cagcacatta 2760 

50 aactcatttg aagatttgga gtcggtaaag ggatagatcc aatttataaa ggtccatcat 2 820 
gaatctaagt tttacttgaa agttttgtat atttatttat atgtatagat gatgatatta 2 880 
catcacaatc caatctcagt tttgaaatat ttcggcttat ttcattgaca tctggtttat 2 94 0 
tcactccaaa taaacacatg ggcagttaaa aacatcctct attaatagat tacccattaa 3000 
ttcttgaggt gtatcacagc tttaaagggt tttaaatatt tttatataaa taagactgag 3 060 

55 agttttataa atgtaatttt ttaaaactcg agtcttactg tgtagctcag aaaggcctgg 3120 
aaattaatat attagagagt catgtcttga acttatttat ctctgcctcc ctctgtctcc 3180 
agagtgttgc ttttaagggc atgtagcacc acacccagct atgtacgtgt gggattttat 3240 

! 
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aatgctcatt tttgagacgt ttatagaata aaagataatt gcttttatgg tataaggcta 3300 
cttgaggtaa 3310 

Table 5. Amino Acid Sequence for Murine TLR3 
5 (GenBank Accession No. AAK261 17: SEP ID NO:4) 

MKGCSSYLMY SFGGLLSLWI LLVSSTNQCT VRYNVADCSH LKLTHIPDDL PSNITVLNLT 60 

HNQLRRLPPT NFTRYSQLAI LDAGFNSISK LEPELCQILP LLKVLNLQHN ELSQISDQTF 120 

VFCTNLTELD LMSNSIHKIK SNPFKNQKNL IKLDLSHNGL SSTKLGTGVQ LENLQELLLA 180 

KNKILALRSE ELEFLGNSSL RKLDLSSNPL KEFSPGCFQT IGKLFALLLN NAQLNPHLTE 24 0 

JO KLCWELSNTS IQNLSLANNQ LLATSESTFS GLKWTNLTQL DLSYNNLHDV GNGSFSYLPS 300 

LRYLSLEYKN IQRLSPRSFY GLSNLRYLSL KRAFTKQSVS LASHPNIDDF SFQWLKYLEY 360 

LNMDDNNIPS TKSNTFTGLV SLKYLSLSKT FTSLQTLTNE TFVSLAHSPL LTLNLTKNHI 420 

SKIANGTFSW LGQLRILDLG LNEIEQKLSG QEWRGLRNIF EIYLSYNKYL QLSTSSFALV 4 80 

PSLQRLMLRR VALKNVDISP SPFRPLRNLT ILDLSNNNIA NINEDLLEGL ENLEILDFQH 540 

15 NNLARLWKRA NPGGPVNFLK GLSHLHILNL ESNGLDEIPV GVFKNLFELK SINLGLNNLN 600 

KLEPFIFDDQ TSLRSLNLQK NLITSVEKDV FGPPFQNLNS LDMRFNPFDC TCESISWFVN 660 

WINQTHTNIF ELSTHYLCNT PHHYYGFPLK LFDTSSCKDS APFELLFIIS TSMLLVFILV 720 

VLLIHIEGWR ISFYWNVSVH RILGFKEIDT QAEQFEYTAY IIHAKKDRDW VWEHFSPMEE 780 

QDQSLKFCLE ERDFEAGVLG LEAIVNSIKR SRKIIFVITH HLLKDPLCRR FKVHHAVQQA 840 

20 IEQNLDSIIL IFLQNIPDYK LNHALCLRRG MFKSHCILNW PVQKERINAF HHKLQVALGS 900 

RNSAH 905 

Example 2. Method of Making IFN-ct4 Reporter Vector 

A number of reporter vectors may be used in the practice of the invention. 

25 Some of the reporter vectors are commercially available, e.g., the luciferase reporter 
vectors pNF-KB-Luc (Stratagene) and pAPl-Luc (Stratagene). These two reporter 
vectors place the luciferase gene under control of an upstream (5') promoter region 
derived from genomic DNA for NF-kB or API, respectively. Other reporter vectors 
can be constructed following standard methods using the desired promoter and a vector 

30 containing a suitable reporter, such as luciferase, P-galactosidase (p-gal), 

chloramphenicol acetyltransferase (CAT), and other reporters known by those skilled in 
the art. Following are some examples of reporter vector^ constructed for use in the 
present invention. ' 

IFN-a4 is an immediate-early type 1 IFN. Sequence-specific PCR products for 

35 the -620 to +50 promoter region of IFN-a4 were derived from genomic DNA of 

human 293 cells and cloned into Smal site of the pGL3-Basic Vector (Promega). The 
resulting expression vector includes a luciferase gene under control of an upstream (5 f ) 
-620 to +50 promoter region of EFN-a4. The sequence of the -620 to +50 promoter 
region of IFN-a4 is provided as SEQ ID NO:l 1 in Table 6. 

40 
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Table 6. Nucleotide Sequence of the -620 to +50 Promoter Region of Human IFN-a4 

(SEP ID NO: in . . 

agaaaaattt taaaaaatta ttcattcata tttttaggag ttttgaatga ttggatatgt 60 
aattatattc atattattaa tgtgtatcta tatagatttt tattttgcat atgtactttg 120 

5 atacaaaatt tacatgaaca aattacacta aaagttattc cacaaatata cttatcaaat 180 
taagttaaat gtcaatagct tttaaactta aattttagtt taacttttct gtcattcttt 240 
actttgaata aaaagagcaa actttgtagt ttttatctgt gaagtagagg tatacgtaat 300 
atacataaat agatatgcca aatctgtgtt attaaaattt catgaagatt tcaattagaa 360 
aaaaatacca taaaaggctt tgagtgcagg tgaaaaatag gcaatgatga aaaaaaatga 420 

10 aaaacttttt aaacacatgt agagagtgcg taaagaaagc aaaaacagag atagaaagta 480 
caactaggga atttagaaaa tggaaattag tatgttcact atttaagacc tatgcacaga 540 
gcaaagtctt cagaaaacct agaggccgaa gttcaaggtt atccatctca agtagcctag 600 
caatatttgc aacatcccaa tggccctgtc cttttcttta ctgatggccg tgctggtgct 660 

670 

cagctacaaa 

15 

Example 3. Method of Making IFN-ccl Reporter Vector 

IFN-ccl is a late type 1 IFN. Sequence-specific PCR products for the -140 to 
+9 promoter region of IFN-ctl were derived from genomic DNA of human 293 cells 
and cloned into Smal site of the pGL3-Basic Vector (Promega). The resulting 
20 expression vector includes a luciferase gene under control of an upstream (5") -140 to 
+9 promoter region of IFN-al . 

Example 4. Method of Making IFN-P Reporter Vector 

IFN-p is an immediate-early type 1 IFN. The -280 to +20 promoter region of 
25 IFN-p was derived from the P UCp26 vector (Algarte M et al. (1999) J Virol 

73(4):2694-702) by restriction at EcoRl and Taql sites. The 300 bp restriction fragment 
was filled in by Klenow enzyme and cloned into Mel-digested and filled in pGL3- 
Basic Vector (Promega). The resulting expression vector includes a luciferase gene 
under control of an upstream (5') -280 to +20 promoter region of IFN-p. The sequence 
30 of the -280 to +20 promoter region of IFN-P is provided as SEQ ID NO:12 in Table 7. 

Table 7. Nucleotide Sequence of the -280 to +20 Promoter Region of Human IFN-p 

(SEP ID NO: 12) 

ttctcaggtc gtttgctttc ctttgctttc tcccaagtct tgttttacaa tttgctttag 60 
35 tcattcactg aaactttaaa aaacattaga aaacctcaca gtttgtaaat ctttttccct 120 
attatatata tcataagata ggagcttaaa taaagagttt tagaaactac taaaatgtaa 180 
atgacatagg aaaactgaaa gggagaagtg aaagtgggaa attcctctga atagagagag 240 
gaccatctca tataaatagg ccatacccac ggagaaagga cattctaact gcaacctttc 300 
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Example 5. Method of Making RANTES Reporter Vector 

Transcription of the chemokine RANTES is believed to be regulated at least in 
part by IRF3 and by NF-kB. Lin R et al. (1999) JMol Cell Biol 19(2):959-66; Genin P 
et al. (2000) J Immunol 164:5352-61. A 483 bp sequence-specific PCR product 

5 including the -397 to +5 promoter region of RANTES was derived from genomic DNA 
of human 293 cells, restricted with PstI and cloned into pCAT-Basic Vector (Promega) 
using Hindni (filled in with Klenow) and PstI sites (filled in). The -397 to +5 
promoter region of RANTES was then isolated from the resulting 
RANTES/chloramphenicol acetyltransferase (CAT) reporter plasmid by restriction with 

W Bgin and Sail, filled in with Klenow enzyme, and cloned into the Nhel site (filled in 
with Klenow) of the pGL3-Basic Vector (Promega). The resulting expression vector 
includes a luciferase gene under control of an upstream (5') -397 to +5 promoter region 
of RANTES. Comparison of the insert sequence -397 to +5 of Genin P et al (2000) J 
Immunol 164:5352-61 and GenBank accession no. AB023652 (SEQ ID NO:13) 

/ 5 revealed two point deletions (at positions 105 and 273 of SEQ ID NO: 13) which do not 
create new restriction sites. The sequence of the -397 to +5 promoter region of 
RANTES is provided as SEQ ID NO: 14 in Table 8. 

Table 8. Nucleotide Sequence of the -397 to +5 Promoter Region of Human RANTES 
20 (SEP ID NO: 14) r — 

60 
120 
180 
240 
300 
360 
401 



gatctgtaat gaataagcag gaactttgaa gactcagtga cjtcagtgagt aataaagact 
cagtgacttc tgatcctgtc ctaactgcca ctccttgttg tcccaagaaa gcggcttcct 
gctctctgag gaggacccct tccctggaag gtaaaactaa ggatgtcagc agagaaattt 
ttccaccatt ggtgcttggt caaagaggaa actgatgagc tcactctaga tgagagagca 
25 gtgagggaga gacagagact cgaatttccg gagctatttc agttttcttt tccgttttgt 
gcaatttcac ttatgatacc ggccaatgct tggttgctat tttggaaact ccccttaggg 
gatgcccctc aactggccct ataaagggcc agcctgagct g 



30 



Table 9. Nucleotide Sequence of GenBank Access i on No. AB023652 (SEQ ID NO:13) 



agaaggcctt acagtgagat gggatcccag tatttattga gtttcctcat tcataaaatg 
gggataataa tagtaaatga gttgacacgc gctaagacag tggaatagtg gctggcacag 
ataagccctc ggtaaatggt agccaataat gatagagtat gctgtaagat atctttctct 
ccctctgctt ctcaacaagt ctctaatcaa ttattccact ttataaacaa ggaaatagaa 
ctcaaagaca ttaagcactt ttcccaaagg tcgcttagca agtaaatggg agagacccta 
35 tgaccaggat gaaagcaaga aattcccaca agaggactca ttccaactca tatcttgtga 
aaaggttccc aatgcccagc tcagatcaac tgcctcaatt tacagtgtga gtgtgctcac 
ctcctttggg gactgtatat ccagaggacc ctcctcaata aaacacttta taaataacat 
ccttccatgg atgagggaaa ggaggtaaga tctgtaatga ataagcagga actttgaaga 
ctcagtgact cagtgagtaa taaagactca gtgacttctg atcctgtcct aactgccact 
40 ccttgttgtc cccaagaaag cggcttcctg ctctctgagg aggacccctt ccctggaagg 
taaaactaag gatgtcagca gagaaatttt tccaccattg gtgcttggtc aaagaggaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
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ctgatgagct cactctagat gagagagcag tgagggagag acagagactc gaatttccgg 780 
aggctatttc agttttcttt tccgttttgt gcaatttcac ttatgatacc ggccaatgct 840 
tggtVgctat tttggaaact ccccttaggg gatgcccctc aactggccct ataaagggcc 900 
agcctgagct gcagaggatt cctgcagagg atcaagacag cacgtggacc tcgcacagcc 960 
5 tctcccacag gtaccatgaa ggtctccgcg gcagccctcg ctgtcatcct cattgctact 1020 
gccctctgcg c 1031 

Example 6. Method of Making Human IL-12 p40 Reporter Vectors 

Reporter constructs have been made using truncated (-250 to +30) and full 
10 length (-860 to +30) promoter regions derived from human IL-12 p40 genomic DNA. 
In one reporter construct the truncated IL-12 p40 promoter was cloned as a KpnI-XhoI 
insert into ppgal-Basic (Promega). The resulting expression vector includes a p gal 
gene under control of an upstream (5') -250 to +30 promoter region of human IL-12 
p40. In a second reporter construct the full length DL-12 p40 promoter was cloned as a 
15 KpnI-XhoI insert into ppgal-Basic (Promega). The resulting expression vector includes 
a P gal gene under control of an upstream (5') -860 to +30 promoter region of human 
IL-12 p40. In a third reporter construct the truncated -250 to +30 promoter region of 
human IL-12 p40 was cloned into the pGL3-Basic Vector (Promega). The resulting 
expression vector includes a luciferase gene under control of an upstream (5') -250 to 
20 +30 promoter region of human IL-12 p40. In a fourth reporter construct the full length 
IL-12 p40 promoter of human IL-12 p40 was cloned into the pGL3-Basic Vector 
(Promega). The resulting expression vector includes a luciferase gene under control of 
an upstream (5 1 ) -860 to +30 promoter region of human IL-12 p40. 

25 Example 7. Method of Making Human IL-6 Reporter Vectors 

Reporter constructs are made using the -235 to +7 promoter region derived from 
human IL-6 genomic DNA. In one reporter construct the EL-6 promoter region is 
cloned as a KpnI-XhoI insert into pGL3-Basic Vector (Promega). The resulting 
expression vector includes a luciferase gene under control of an upstream (5 1 ) -235 to 
30 +7 promoter region derived from human IL-6 genomic DNA. 

Example 8. Method of Making Human IL-8 Reporter Vectors 

Reporter constructs have been made using a -546 to +44 and a truncated -133 to 
+44 promoter region derived from human IL-8 genomic DNA. Mukaida N et al. 
35 (1989) J Immunol 143:1366-71. In each reporter construct the IL-8 promoter region 
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was cloned as a KpnI-XhoI insert into pGL3-Basic Vector (Promega). One of the 
resulting expression vectors includes a luciferase gene junder control of an upstream (5 1 ) 
-546 to +44 promoter region derived from human IL-8 genomic DNA. Another of the 
resulting expression vectors includes a luciferase gene under control of an upstream (5 f ) 
5 -1 33 to +44 promoter region derived from human IL-8 genomic DNA. 

Example 9. Sequence Comparison of Human TLR3 and Human TLR9 

Human TLR3 and TLR9 are homologous proteins with several structural 
commonalities. Both appear to be transmembrane proteins with an extracellular 

10 domain and an intracellular domain. Common characteristics include a signal sequence 
and transmembranal domain. Similarities common to most TLRs include a cysteine 
rich domain and a TIR domain. Most TLRs have leucine rich repeats (LRR) in their 
extracellular domain. TLR3, TLR7, TLR8, and TLR9 appear to have similar 
structures. The regularity of the leucine repeats are shown below for TLR3 and TLR9. 

15 These four TLRs can be broken into two extracellular subdomains, domain 1 and 2, by 
virtue of a separation by an unstructured hinge region. [TLR7, TLR8, and TLR9 have 
14 LRR in domain 1 and 12 LRR in domain 2. TLR9 is a known nucleic acid binder, 
interacting with CpG-DNA. It has been suspected that TLR7 and TLR8 most likely 
also interact with nucleic acids. TLR3 has a similar 1 1 LRR in domain 1 and has 12 

20 LRR in domain 2, lacking the initial 3 repeats common to TLR7, TLR8, and TLR9. 
Based on structural consideration it is hypothesized that TLR3 interacts with nucleic 
acids or similar structures. 

The structure of TLR3 differs from TLR7, TLR8, and TLR9 in an interesting 
character. Referring to Table 13, within the TIR domain it has been shown that a 

25 proline (shown in bold) is required for MyD88 interaction. MyD88 is required for 

TLR9 to transduce signal for the activation of NF-kB. Both TLR7 and TLR8 also have 
this proline. TLR3 however has an alanine at this position (also shown in bold). It is 
believed by the applicant that this difference may disallow MyD88 interaction with 
TLR3 and thus result in an altered signal transduction pattern compared to, e.g., TLR9. 

30 

Table 10. Sequence Alignment of hTLR9 (SEP ID NO:6) and hTLR3 (SEP ID NO:2) 

Signal Sequence 
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hTLR9 MGFCRSALHPLSLLVQAIMLAMTLALGTLPAFLPCELQPHGLVNCNW 
hTLR3 MRQTLPCIYFWGGLLPFGMLCASSTTKCTVSHEVADC 



47 
37 



5 hTLR9 



Domain 1 Leucine Rich Repeats 
lflks vphfsmaaprgnvtslsls sn 



73 



hTLR9 



RIHHLHDSDFAHLPSLRHLNLKWN 



97 



JO 



hTLR9 



CPPVGLSPMHFPCHMTIEPSTFLAVPTLEELNLSYN 



133 



15 



hTLR9 
hTLR3 



NIMTVPALPKSLISLSLSHT 
SHLKLTQVPDDLPTNITVLNLTHN 



153 
61 



hTLR9 
hTLR3 



NILMLDSASLAGLHALRFLFMDGN 
QLRRLPAANFTRYSQLTSLDVGFN 



177 
85 



20 



25 
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hTLR9 CYYKNPCRQALEVAPGALLGLGNLTHLSLKYN 

hTLR3 TISKLEPELCQKLPMLKVLNLQHN 

hTLR9 NLTWPRNLPSSLEYLLLSYN 

hTLR3 ELSQLSDKTFAFCTNLTELHLMSN 

hTLR9 RIVKLAPEDLANLTALRVLDVGGN 

hTLR3 SIQKIKNNPFVKQKNLITLDLSHN 

hTLR9 CRRCDHAPNPCMECPRHFPQ LUPDTFSHL SRLEGLVLKDS 

hTLR3 GLSSTKLGTQVQLENLQELLLSNN 



209 
109 

230 
133 

254 
157 

294 
181 
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40 



45 



50 



55 



hTLR9 
hTLR3 

hTLR9 
hTLR3 

hTLR9 
hTLR3 

hTLR9 
hTLR3 

hTLR9 
hTLR3 



hTLR9 
hTLR3 



hTLR9 
hTLR3 

hTLR9 
hTLR3 



SLSWLNASWFRGLGNLRVLDLSEN 
KIQALKSEELDIFANSSLKKLELSSN 

FLYKCITKTKAFQGLTQLRKLNLSFN 

QIKEFSPGCFHAIGRLFGLFLNNV 

i 

' I 

YQKRVSFAHLSLAPSFGSLVALKELDMHGI 
QLGPSLTEKLCLELANTS IRNLSLSNS 

FFRSLDETTLRPLARLPMLQTLRLQMN 
QLSTTSNTTFLGLKWTNLTMLDLSYN 

FINQAQLGIFRAFPGLRYVDLSDN 
NLNWGNDSFAWLPQLEYFFLEYN 

Hinge Region 

r i s gas eltatmgeadgge kvwlqpgdl apap v 
niqhlfshslhglfnvrylnlkrsftkqsisla 

Domain 2 Leucine Rich Repeats 

DTPSSEDFRPNCSTLNFTLDLSRN 
SLPKIDDFSFQWLKCLEHLNMEDN 

NLVTVQPEMFAQLSHLQCLRLSHN 
DIPGIKSNMFTGLINLKYLSLSNS 



318 
207 

344 
231 

374 
258 

401 
284 

425 
308 



458 
341 



482 
365 

506 
389 
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Example 10. Reconstitution of TLR9 Signaling in 2^3 Fibroblasts 

Methods for cloning murine and human TLR9 have been described in pending 
U.S. Patent Application No. 09/954,987 and corresponding published PCT application 
5 PCT/US01/29229, both filed September 17, 2001, the contents of which are 

incorporated by reference. Human TLR9 cDNA and murine TLR9 cDNA in pT-Adv 
vector (from Clonetech) were individually cloned into the expression vector 
pcDNA3.1(-) from Invitrogen using the EcoRI site. Utilizing a "gain of function" 
assay it was possible to reconstitute human TLR9 (hTLR9) and murine TLR9 (mTLR9) 
JO signaling in CpG-DNA non-responsive human 293 fibroblasts (ATCC, CRL-1573). 
The expression vectors mentioned above were transfected into 293 fibroblast cells 
using the calcium phosphate method. 

Table 1 1 . cDNA Sequence for Human TLR9 
15 (GenBank Accession No. AF245704: SEP ID NO:5) 

aggctggtat aaaaatctta cttcctctat tctctgagcc gctgctgccc 
ggacctcgag tgtgaagcat ccttccctgt agctgctgtc qagtctgccc 
ctggagaagc ccctgccccc cagcatgggt ttctgccgca gcgccctgca 
ctcctggtgc aggccatcat gctggccatg accctggccc tgggtacctt 

20 ctaccctgtg agctccagcc ccacggcctg gtgaactgca actggctgtt 
gtgccccact tctccatggc agcaccccgt ggcaatgtca ccagcctttc 
aaccgcatcc accacctcca tgattctgac tttgcccacc tgcccagcct 
aacctcaagt ggaactgccc gccggttggc ctcagcccca tgcacttccc 
accatcgagc ccagcacctt cttggctgtg cccaccctgg aagagctaaa 

25 aacaacatca tgactgtgcc tgcgctgccc aaatccctca tatccctgtc 
accaacatcc tgatgctaga ctctgccagc ctcgccggcc tgcatgccct 
ttcatggacg gcaactgtta ttacaagaac ccctgcaggc aggcactgga 
ggtgccctcc ttggcctggg caacctcacc cacctgtcac tcaagtacaa 
gtggtgcccc gcaacctgcc ttccagcctg gagtatctgc tgttgtccta 

30 gtcaaactgg cgcctgagga cctggccaat ctgaccgccc tgcgtgtgct 
ggaaattgcc gccgctgcga ccacgctccc aacccctgca tggagtgccc 
ccccagctac atcccgatac cttcagccac ctgagccgtc ttgaaggcct 
gacagttctc tctcctggct gaatgccagt tggttccgtg ggctgggaaa 
ctggacctga gtgagaactt cctctacaaa tgcatcacta aaaccaaggc 

35 ctaacacagc tgcgcaagct taacctgtcc ttcaattacc aaaagagggt 
cacctgtctc tggccccttc cttcgggagc ctggtcgccc tgaaggagct 
ggcatcttct tccgctcact cgatgagacc acgctccggc cactggcccg 
ctccagactc tgcgtctgca gatgaacttc atcaaccagg cccagctcgg 
gccttccctg gcctgcgcta cgtggacctg tcggacaacc gcatcagcgg 

40 ctgacagcca ccatggggga ggcagatgga ggggagaagg tctggctgca 
cttgctccgg ccccagtgga cactcccagc tctgaagact tcaggcccaa 
ctcaacttca ccttggatct gtcacggaac aacctggtga ccgtgcagcc 
gcccagctct cgcacctgca gtgcctgcgc ctgagccaca ajctgcatctc 
aatggctccc agttcctgcc gctgaccggt ctgcaggtgc fcagacctgtc 

45 ctggacctct accacgagca ctcattcacg gagctaccgc gactggaggc 
agctacaaca gccagccctt tggcatgcag ggcgtgggcc acaacttcag 
cacctgcgca ccctgcgcca cctcagcctg gcccacaaca acatccacag 



ctgtgggaag 60 
gccagaccct 12 0 
cccgctgtct 180 
gcctgccttc 240 
cctgaagtct 300 
cttgtcctcc 360 
gcggcatctc 420 
ctgccacatg 4 80 
cctgagctac 54 0 
cctcagccat 600 
gcgcttccta 660 
ggtggccccg 720 
caacctcact 780 
caaccgcatc 640 
cgatgtgggc 900 
tcgtcacttc 960 
ggtgttgaag 1020 
cctccgagtg 1080 
cttccagggc 114 0 
gtcctttgcc 1200 
ggacatgcac 1260 
cctgcccatg 1320 
catcttcagg 1380 
agcttcggag 144 0 
gcctggggac 1500 
ctgcagcacc 1560 
ggagatgttt 1620 
gcaggcagtc 1680 
ccgcaataag 1740 
cctggacctc 1800 
cttcgtggct 1860 
ccaagtgtcc 1920 
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cagcagctct gcagtacgtc gctgcgggcc ctggacttca gcggcaatgc actgggccat 1980 
atgtgggccg agggagacct ctatctgcac ttcttccaag gcctgagcgg tttgatctgg 2 040 
ctggacttgt cccagaaccg cctgcacacc ctcctgcccc aaaccctgcg caacctcccc 2100 
aagagcctac aggtgctgcg tctccgtgac aattacctgg ccttctttaa gtggtggagc 2160 
5 ctccacttcc tgcccaaact ggaagtcctc gacctggcag gaaaccggct gaaggccctg 2220 
accaatggca gcctgcctgc tggcacccgg ctccggaggc tggatgtcag ctgcaacagc 2280 
atcagcttcg tggcccccgg cttcttttcc aaggccaagg agctgcgaga gctcaacctt 2340 
agcgccaacg ccctcaagac agtggaccac tcctggtttg ggcccctggc gagtgccctg 2400 
caaatactag atgtaagcgc caaccctctg cactgcgcct gtggggcggc ctttatggac 2460 

10 ttcctgctgg aggtgcaggc tgccgtgccc ggtctgccca gccgggtgaa gtgtggcagt 2520 
ccgggccagc tccagggcct cagcatcttt gcacaggacc tgcgcctctg cctggatgag 2580 
gccctctcct gggactgttt cgccctctcg ctgctggctg tggctctggg cctgggtgtg 2640 
cccatgctgc atcacctctg tggctgggac ctctggtact gcttccacct gtgcctggcc 2700 
tggcttccct ggcgggggcg gcaaagtggg cgagatgagg atgccctgcc ctacgatgcc 2760 

15 ttcgtggtct tcgacaaaac gcagagcgca gtggcagact gggtgtacaa cgagcttcgg 2820 
gggcagctgg aggagtgccg tgggcgctgg gcactccgcc tgtgcctgga ggaacgcgac 2880 
tggctgcctg gcaaaaccct ctttgagaac ctgtgggcct cggtctatgg cagccgcaag 2940 
acgctgtttg tgctggccca cacggaccgg gtcagtggtc tcttgcgcgc cagcttcctg 3000 
ctggcccagc agcgcctgct ggaggaccgc aaggacgtcg tggtgctggt gatcctgagc 3060 

20 cctgacggcc gccgctcccg ctacgtgcgg ctgcgccagc gcctctgccg ccagagtgtc 3120 
ctcctctggc cccaccagcc cagtggtcag cgcagcttct gggcccagct gggcatggcc 3180 
ctgaccaggg acaaccacca cttctataac cggaacttct gccagggacc cacggccgaa 3240 
tagccgtgag ccggaatcct gcacggtgcc acctccacac tcacctcacc tctgcctgcc 3300 
tggtctgacc ctcccctgct cgcctccctc accccacacc tgacacagag ca 3352 



25 



Table 12. Amino Acid Sequence for Human TLR9 
(GenBank Accession No. AAF78037: SEP ID NQ:6) 



MGFCRSALHP LSLLVQAIML AMTLALGTLP AFLPCELQPH GLVNCNWLFL KSVPHFSMAA 60 

PRGNVTSLSL SSNRIHHLHD SDFAHLPSLR HLNLKWNCPP VGLSPMHFPC HMTIEPSTFL 120 

30 AVPTLEELNL SYNNIMTVPA LPKSLISLSL SHTNILMLDS ASLAGLHALR FLFMDGNCYY 180 

KNPCRQALEV APGALLGLGN LTHLSLKYNN LTWPRNLPS SLEYLLLSYN RIVKLAPEDL 240 

ANLTALRVLD VGGNCRRCDH APNPCMECPR HFPQLHPDTF SHLSRLEGLV LKDSSLSWLN 300 

. ASWFRGLGNL RVLDLSENFL YKCITKTKAF QGLTQLRKLN LSFNYQKRVS FAHLSLAPSF 360 

GSLVALKELD MHGIFFRSLD ETTLRPLARL PMLQTLRLQM NFINQAQLGI FRAFPGLRYV 42 0 

35 DLSDNRISGA SELTATMGEA DGGEKVWLQP GDLAPAPVDT ^SSEDFRPNC STLNFTLDLS 480 

RNNLVTVQPE MFAQLSHLQC LRLSHNCISQ AVNGSQFLPL tGLQVLDIiSR NKLDLYHEHS 54 0 

FTELPRLEAL DLSYNSQPFG MQGVGHNFSF VAHLRTLRHL SLAHNNIHSQ VSQQLCSTSL 600 

RALDFSGNAL GHMWAEGDLY LHFFQGLSGL IWLDLSQNRL HTLLPQTLRN LPKSLQVLRL 660 

RDNYLAFFKW WSLHFLPKLE VLDLAGNRLK ALTNGSLPAG TRLRRLDVSC NSISFVAPGF 720 

40 FSKAKELREL NLSANALKTV DHSWFGPLAS ALQILDVSAN PLHCACGAAF MDFLLEVQAA 780 

VPGLPSRVKC GSPGQLQGLS IFAQDLRLCL DEALSWDCFA LSLLAVALGL GVPMLHHLCG 840 

WDLWYCFHLC LAWLPWRGRQ SGRDEDALPY DAFWFDKTQ SAVADWVYNE LRGQLEECRG 900 

RWALRLCLEE RDWLPGKTLF ENLWASVYGS RKTLFVLAHT DRVSGLLRAS FLLAQQRLLE 960 

DRKDVWLVI LSPDGRRSRY VRLRQRLCRQ SVLLWPHQPS GQRSFWAQLG MALTRDNHHF 1020 

45 YNRNFCQGPT AE 1032 



Table 13. cDNA Sequence for Murine TLR9 

(GenBank Accession No. AF348140: SEP ID NO:7) 

tgtcagaggg agcctcggga gaatcctcca tctcccaaca tggttctccg tcgaaggact 60 

50 ctgcacccct tgtccctcct ggtacaggct gcagtgctgg ctgagactct ggccctgggt 120 

accctgcctg ccttcctacc ctgtgagctg aagcctcatg gcctggtgga ctgcaattgg 180 

ctgttcctga agtctgtacc ccgtttctct gcggcagcat cctgctccaa catcacccgc 24 0 

ctctccttga tctccaaccg tatccaccac ctgcacaact ccgacttcgt ccacctgtcc 300 

aacctgcggc agctgaacct caagtggaac tgtccaccca ctggccttag ccccctgcac 360 
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ttctcttgcc acatgaccat tgagcccaga accttcctgg ctatgcgtac actggaggag 420 
ctgaacctga gctataatgg tatcaccact gtgccccgac tgcccagctc cctggtgaat 480 
ctgagcctga gccacaccaa catcctggtt ctagatgcta acagcctcgc cggcctatac 54 0 
agcctgcgcg ttctcttcat ggacgggaac tgctactaca agaacccctg cacaggagcg 600 
5 gtgaaggtga ccccaggcgc cctcctgggc ctgagcaatc tcacccatct gtctctgaag 660 
tataacaacc tcacaaaggt gccccgccaa ctgcccccca gcctggagta cctcctggtg 720 
tcctataacc tcattgtcaa gctggggcct gaagacctgg ccaatctgac ctcccttcga 780 
gtacttgatg tgggtgggaa ttgccgtcgc tgcgaccatg cccccaatcc ctgtatagaa 840 
tgtggccaaa agtccctcca cctgcaccct gagaccttcc atcacctgag ccatctggaa 900 

JO ggcctggtgc tgaaggacag ctctctccat acactgaact cttcctggtt ccaaggtctg 960 
gtcaacctct cggtgctgga cctaagcgag aactttctct atgaaagcat caaccacacc 1020 
aatgcctttc agaacctaac ccgcctgcgc aagctcaacc tgtccttcaa ttaccgcaag 1080 
aaggtatcct ttgcccgcct ccacctggca agttccttca agaacctggt gtcactgcag 114 0 
gagctgaaca tgaacggcat cttcttccgc tcgctcaaca agtacacgct cagatggctg 1200 

15 gccgatctgc ccaaactcca cactctgcat cttcaaatga acttcatcaa ccaggcacag 1260 
ctcagcatct ttggtacctt ccgagccctt cgctttgtgg acttgtcaga caatcgcatc 1320 
agtgggcctt caacgctgtc agaagccacc cctgaagagg cagatgatgc agagcaggag 1380 
gagctgttgt ctgcggatcc tcacccagct ccactgagca cccctgcttc taagaacttc 144 0 
atggacaggt gtaagaactt caagttcacc atggacctgt ctcggaacaa cctggtgact 1500 

20 atcaagccag agatgtttgt caatctctca cgcctccagt gtcttagcct gagccacaac 1560 
tccattgcac aggctgtcaa tggctctcag ttcctgccgc tgactaatct gcaggtgctg 1620 
gacctgtccc ataacaaact ggacttgtac cactggaaat cgttcagtga gctaccacag 1680 
ttgcaggccc tggacctgag ctacaacagc cagcccttta gcatgaaggg tataggccac 174 0 
aatttcagtt ttgtggccca tctgtccatg ctacacagcc ttagcctggc acacaatgac 1800 

25 attcataccc gtgtgtcctc acatctcaac agcaactcag tgaggtttct tgacttcagc 1860 
ggcaacggta tgggccgcat gtgggatgag gggggccttt atctccattt cttccaaggc 192 0 
ctgagtggcc tgctgaagct ggacctgtct caaaataacc tgcatatcct ccggccccag 1980 
aaccttgaca acctccccaa gagcctgaag ctgctgagcc tccgagacaa ctacctatct 2040 
ttctttaact ggaccagtct gtccttcctg cccaacctgg aagtcctaga cctggcaggc 2100 

30 aaccagctaa aggccctgac caatggcacc ctgcctaatg gcaccctcct ccagaaactg 2160 
gatgtcagca gcaacagtat cgtctctgtg gtcccagcct tcttcgctct ggcggtcgag 2220 
ctgaaagagg tcaacctcag ccacaacatt ctcaagacgg tggatcgctc ctggtttggg 2280 
cccattgtga tgaacctgac agttctagac gtgagaagca accctctgca ctgtgcctgt 234 0 
ggggcagcct tcgtagactt actgttggag gtgcagacca aggtgcctgg cctggctaat 2400 

35 ggtgtgaagt gtggcagccc cggccagctg cagggccgta gcatcttcgc acaggacctg 2460 
cggctgtgcc tggatgaggt cctctcttgg gactgctttg gcctttcact cttggctgtg 2520 
gccgtgggca tggtggtgcc tatactgcac catctctgcg gctgggacgt ctggtactgt 2580 
tttcatctgt gcctggcatg gctacctttg ctggcccgca gccgacgcag cgcccaagct 264 0 
ctcccctatg atgccttcgt ggtgttcgat aaggcacaga gcgcagttgc ggactgggtg 2700 

40 tataacgagc tgcgggtgcg gctggaggag cggcgcggtc gccgagccct acgcttgtgt 2760 
ctggaggacc gagattggct gcctggccag acgctcttcg agaacctctg ggcttccatc 2820 
tatgggagcc gcaagactct atttgtgctg gcccacacgg accgcgtcag tggcctcctg 2 880 
cgcaccagct tcctgctggc tcagcagcgc ctgttggaag accgcaagga cgtggtggtg 2 94 0 
ttggtgatcc tgcgtccgga tgcccaccgc tcccgctatg tgcgactgcg ccagcgtctc 3 000 

45 tgccgccaga gtgtgctctt ctggccccag cagcccaacg ggcagggggg cttctgggcc 3 060 
cagctgagta cagccctgac tagggacaac cgccacttct ataaccagaa cttctgccgg 3120 
ggacctacag cagaatagct cagagcaaca gctggaaaca gctgcatctt catgcctggt 3180 
tcccgagttg ctctgcctgc ! 3200 



50 Table 14. Amino Acid Sequence for Murine TLR9 

(GenBank Accession No, AAK29625: SEP ID NO:8) 

MVLRRRTLHP LSLLVQAAVL AETLALGTLP AFLPCELKPH GLVDCNWLFL KSVPRFSAAA 60 

SCSNITRLSL ISNRIHHLHN SDFVHLSNLR QLNLKWNCPP TGLSPLHFSC HMTIEPRTFL 120 

AMRTLEELNL SYNGITTVPR LPSSLVNLSL SHTNILVLDA NSLAGLYSLR VLFMDGNCYY 180 

55 KNPCTGAVKV TPGALLGLSN LTHLSLKYNN LTKVPRQLPP SLEYLLVSYN LIVKLGPEDL 24 0 

ANLTSLRVLD VGGNCRRCDH APNPCIECGQ KSLHLHPETF HHLSHLEGLV LKDSSLHTLN 300 

SSWFQGLVNL SVLDLSENFL YESINHTNAF QNLTRLRKLN LSFNYRKKVS FARLHLASSF 360 
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KNLVSLQELN MNGIPFRSLN KYTLRWLADL PKLHTLHLQM NFINQAQLSI FGTFRALRFV 420 

DLSDNRISGP STLSEATPEE ADDAEQEELL SADPHPAPLS TPASKNFMDR CKNFKFTMDL 4 80 

SRNNLVTIKP EMFVNLSRLQ CLSLSHNSIA QAVNGSQFLP ^TNLQVLDLS HNKLDLYHWK 54 0 

SFSELPQLQA LDLSYNSQPF SMKGIGHNFS FVAHLSMLHS JjSLAHNDIHT RVSSHLNSNS 600 

5 VRFLDFSGNG MGRMWDEGGL YLHFFQGLSG LLKLDLSQNN LHILRPQNLD NLPKSLKLLS 660 

LRDNYLSFFN WTSLSFLPNL EVLDLAGNQL KALTNGTLPN GTLLQKLDVS SNSIVSWPA 720 

FFALAVELKE VNLSHNILKT VDRSWFGPIV MNLTVLDVRS NPLHCACGAA FVDLLLEVQT 780 

KVPGLANGVK CGSPGQLQGR SIFAQDLRLC LDEVLSWDCF GLSLLAVAVG MWPILHHLC 840 

GWDVWYCFHL CLAWLPLLAR SRRSAQALPY DAFWFDKAQ SAVADWVYNE LRVRLEERRG 900 

10 RRALRLCLED RDWLPGQTLF ENLWASIYGS RKTLFVLAHT DRVSGLLRTS FLLAQQRLLE 960 

DRKDVWLVI LRPDAHRSRY VRLRQRLCRQ SVLFWPQQPN GQGGFWAQLS TALTRDNRHF 1020 

YNQNFCRGPT AE 1032 

Since NF-kB activation is central to the IL-l/TLR signal transduction pathway 
15 (Medzhitov R et ah (1998) Mol Cell 2:253-258 (1998); Muzio M et al. (1998) J Exp 
Med 187:2097-101), cells were transfected with hTLR9 or co-transfected with hTLR9 
and an NF-KB-driven luciferase reporter construct. Human 293 fibroblast cells were 
transiently transfected with (Figure 1 A) hTLR9 and a six-times NF-KB-luciferase 
reporter plasmid (NF-kB-1uc, kindly provided by Patrick Baeuerle, Munich, Germany) 
20 or (Figure IB) with hTLR9 alone. After stimulus with CpG-ODN (2006, 2^M, 

TCGTCGTTTTGTCGTTTTGTCGTT, SEQ ID NO:15^, GpC-ODN (2006-GC, 2jiM, 
TGCTGCTTTTGTGCTTTTGTGCTT, SEQ ID NO:16), LPS (100 ng/ml) or media, 
NF-kB activation by luciferase readout (8h, Figure 1 A) or IL-8 production by ELISA 
(48h, Figure IB) were monitored. Results are representative of three independent 
25 experiments. Figure 1 shows that cells expressing hTLR9 responded to CpG-DNA but 
not to LPS. 

Figure 2 demonstrates the same principle for the transfection of mTLR9. 
Human 293 fibroblast cells were transiently transfected with mTLR9 and the NF-kB- 
luc construct (Figure 2). Similar data was obtained for IL-8 production (not shown). 
30 Thus expression of TLR9 (human or mouse) in 293 cells results in a gain of function 
for CpG-DNA stimulation similar to hTLR4 reconstitution of LPS responses. 

To generate stable clones expressing human TLR9, murine TLR9, or either 
TLR9 with the NF-kB-1uc reporter plasmid, 293 cells were transfected in 10 cm plates 
(2xl0 6 cells/plate) with 16 ng of DNA and selected with 0.7 mg/ml G418 (PAA 

35 Laboratories GmbH, C6lbe, Germany). Clones were tested for TLR9 expression by 

i 

RT-PCR, for example as shown in Figure 3. The clones were also screened for IL-8 
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production or NF-KB-luciferase activity after stimulation with ODN. Four different 
types of clones were generated. 

293-hTLR9-luc: expressing human TLR9 and 6-fold NF-icB-luciferase reporter 
5 293-mTLR9-luc: expressing murine TLR9 and 6-fold NF-KB-luciferase reporter 
293-hTLR9 : expressing human TLR9 } 

293-mTLR9: expressing murine TLR9 

Figure 4 demonstrates the responsiveness of a stable 293-hTLR9-luc clone after 
10 stimulation with CpG-ODN (2006, 2\xM), GpC-ODN (2006-GC, 2nM), Me-CpG-ODN 
(2006 methylated, 2jiM; TZGTZGTTTTGTZGTTTTGTZGTT, Z - 5-methylcytidine, 
SEQ ID NO: 17), LPS (100 ng/ml) or media, as measured by monitoring NF-kB 
activation. Similar results were obtained utilizing IL-8 production with the stable clone 
293-hTLR9. 293-mTLR9-luc were also stimulated with CpG-ODN (1668, 2\iM; 
15 TCCATGACGTTCCTGATGCT, SEQ ID NO: 18), GpC-ODN (1668-GC, 2jiM; 

TCCATGAGCTTCCTGATGCT, SEQ ID NO: 1 9), Me-CpG-ODN (1 668 methylated, 
2\iM; TCCATGAZGTTCCTGATGCT, Z = 5-methylcytidine, SEQ ID NO:20), LPS 
(100 ng/ml) or media, as measured by monitoring NF-kB activation (Figure 5). 
Similar results were obtained utilizing IL-8 production with the stable clone 293- 

20 mTLR9. Results are representative of at least two independent experiments. These 

i 

results demonstrate that CpG-DNA non-responsive celljlines can be stably genetically 
complemented with TLR9 to become responsive to CpG-DNA in a motif-specific 
manner. These cells can be used for screening of optimal ligands for innate immune 
responses driven by TLR9 in multiple species. 

25 

Example 11. Reconstitution of TLR3 Signaling in 293 Fibroblasts 

Human TLR3 cDNA and murine TLR3 cDNA in pT-Adv vector (from 
Clonetech) were individually cloned into the expression vector pcDNA3.1(-) from 
Invitrogen using the EcoRI site. The resulting expression vectors mentioned above 
30 were transfected into CpG-DNA non-responsive human 293 fibroblast cells (ATCC, 
CRL-1 573) using the calcium phosphate method. Utilizing a "gain of function" assay it 
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was possible to reconstitute human TLR3 (hTLR3) and murine TLR3 (mTLR3) 
signaling in 293 fibroblast cells. 

Since NF-kB activation is central to the IL-l/TLR signal transduction pathway 
(Medzhitov R et al. (1998) Mol Cell 2:253-8; Muzio M et al. (1998) J Exp Med 
5 1 87:2097-101), in a first set of experiments human 293 fibroblast cells were transfected 
with hTLR3 alone or co-transfected with hTLR3 and an NF-KB-driven luciferase 
reporter construct. 

Likewise, in a second set of experiments, 293 fibroblast cells were transfected 
with hTLR3 alone or co-transfected with hTLR3 and an EFN-a4-driven luciferase 
JO reporter construct (described in Example 2 above). 

In a third group of experiments, 293 fibroblast cells were transfected with 
hTLR3 alone or co-transfected with hTLR3 and a RANTES-driven luciferase reporter 
construct (described in Example 5 above). 

15 Example 12. Proline to Histidine Mutation P91SH in the TIR Domain of Human 
and MurineTLR9 Alters TLR9 Signaling 

Toll-like receptors have a cytoplasmic Toll/IL-1 receptor (TIR) homology 
domain which initiates signaling after binding of the adapter molecule MyD88. 
Medzhitov R et al. (1998) Mol Cell 2:253-8; Kopp EB et al. (1999) Curr Opin Immunol 

20 1 1:15-8. Reports by others have shown that a single point mutation in the signaling 
TIR domain in murine TLR4 (Pro712 to His, P712H) or human TLR2 (Pro681 to His, 
P681H) abolishes host immune response to lipopolysaccharide or gram-positive 
bacteria, respectively. Poltorak A et al. (1998) Science 282:2085-8; Underhill DM et 
al. (1999) Nature 401:81 1-5. Through site-specific mutagenesis the equivalent proline 

25 (P) at position 915 of human TLR9 and murine TLR9 were mutated to histidine (H; 
P915H). These mutations were generated by the use of the primers 
S'-GCGACTGGCTGCATGGCAAAACCCTCTTTG-S' (SEQ ID NO:21) and 
S'-CAAAGAGGGTTTTGCCATGCAGCCAGTCGC-S' (SEQ ID NO:22) for human 
TLR9 and the primers S'-CGAGATrGGCTGCATGCCCAGACGCTCTTC-S 1 (SEQ 

30 ID NO:23) and 5 f -GAAGAGCGTCTGGCCATGCAGCCAATCTCG-3 ! (SEQ ID 
NO:24) for murine TLR9. Expression vectors for the mutant TLR9s, hTLR9-P915H 
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and mTLR9-P91 5H, were constructed and verified using standard recombinant DNA 
techniques. 

For the stimulation of human TLR9 variant, hTLR9-P915H, 293 cells were 
transiently transfected with expression vector for hTLR9 or hTLR9-P915H and 
5 stimulated after 16 hours with ODN 2006 or ODN 1668 at various concentrations. 
Likewise for the stimulation of murine TLR9 variant, mTLR9-P915H, 293 cells were 
transiently transfected with expression vector for mTLR9 or mTLR9-P915H and 
stimulated after 16 hours with ODN 2006 or ODN 1668 at various concentrations. 
After 48 hours of stimulation, supernatant was harvested and IL-8 production was 
10 measured by ELIS A. Results demonstrated that TLR9 activity can be destroyed by the 
P915H mutation in the TIR domain of both human and murine TLR9. 

Example 13. Exchange of the TIR Domain Between Human TLR3 and Human 
TLR9 (hTLR3-TIR9 and hTLR9-TIR3) 

15 While TLR3 and TLR9 share many structural features, TLR3, by virtue of its 

having an alanine rather than proline at a critical position in the TIR domain, may not 
be able to signal via MyD88 as does TLR9. The chimeric TLRs described here can be 
used in the screening assays of the invention. To generate molecules consisting of 
human extracellular TLR3 and the TIR domain of human TLRS (hTLR3-TIR9), the 

20 following approach can be used. Through site-specific mutagenesis a Clal restriction 
site is introduced in human TLR3 and human TLR9. For human TLR9 the DNA 
sequence 5'-GGCCTCAGCATCTTT-3 , (3026-3040, SEQ ID NO:25) is mutated to 5»- 
GGCCTAICGMTTTT-3' (SEQ ID NO:26), introducing a Clal site (underlined in the 
sequence) but leaving the amino acid sequence (GLSIF, aa 798-802) unchanged. For 

25 human TLR3 the DNA sequence S'-GGGTTCCCAGTGAGA-T (21 12-2126, SEQ ID 
NO:27) is mutated to 5'-GGGT TATCGATT AGA-3' (SEQ ID NO:28), introducing a 
Clal site and creating the amino acid sequence (GLSIR, aa 685-689) which differs in 
three positions (aa 686, 687, 688) from the wildtype human TLR3 sequence (GFPVR, 
aa 685-689). 

30 hTLR3-TIR9. The primers used for human TLR9 are 

5'-CAGCTCCAGGGCCTATCGATTTTTGCACAGGACC-3' (SEQ ID NO:29) and 
5-GGTCCTGTGCAAAAATCGATAGGCCCTGGAGCTG-3' (SEQ ID NO:30). For 
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creating an expression vector containing the extracellular portion of human TLR3 
connected to the TIR domain of human TLR9, the human TLR3 expression vector is 
cut with Clal and limiting amounts of EcoRI and the fragment coding for the TIR 
domain of human TLR9 generated by a Clal and EcoRI digestion of human TLR9 
5 expression vector is ligated in the vector fragment containing the extracellular portion 
of hTLR3. Transfection into E.coli yields the expression vector hTLR3-TIR9 (human 
extracellular TLR3-human TLR9 TIR domain). The expressed product of hTLR3- 
TIR9 can interact with TLR3 ligands and also signal through an MyD88-mediated 
signal transduction pathway. 

10 hTLR9-TIR3. A fusion construct with the extracellular domain of hTLR9 and 

the TIR domain of hTLR3 is prepared using an analogous strategy. For creating an 
expression vector containing the extracellular portion of human TLR9 connected to the 
TIR domain of human TLR3, the human TLR9 expression vector is cut with Clal and 
limiting amounts of EcoRI and the fragment coding for the TIR domain of human 

75 TLR3 generated by a Clal and EcoRI digestion of human TLR3 expression vector is 
ligated in the vector fragment containing the extracellular portion of hTLR9. 
Transfection into E.coli yields the expression vector hTLR9-TIR3 (human extracellular 
TLR9-human TLR3 TIR domain). The expressed product of hTLR9-TIR3 can interact 
with TLR9 ligands, e.g., CpG DNA, and signal through a signal transduction pathway 

20 in a manner like TLR3. 



Example 14. Sensitive in vitro Assay for Detecting Ligand Affinity Differences for 
aTLR 

Human 293 fibroblast cells stably transfected with murine TLR9 and an NF-kB- 
25 luciferase reporter were stimulated for 16 hours with the following fully 
phosphorothioated oligodeoxynucleotides (ODN): 

5890: T*C*C*A*T*G*A*C*G*T*T*T*T*T*G*A*T*G*T*T (SEQ ID NO:31) 

5895: T*C*C*A*T*G*A*C*G*T*T*T*T*T*G*A*T*G (SEQ ID NO:32) 

30 5896: T*C*C*A*T*G*A*C*G*T*T*T*T*T*G*A (SEQIDNO:33) 
5897: T*C'C*A*T*G*A*C*G*T*T*T*T*T (SEQIDNO:34) 

Concentration of the stimulus was titrated between 10 and 2 nM. The data is 
plotted in Figure 6 as fold induction of NF-kB luciferase, relative to unstimulated 
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background, versus ODN concentration. The data displays typical first-order binding 
from which EC50 or maximal activity can be determined. EC50 is defined as the 
concentration of the ligand stimulus that results in 50% maximal activation. As shown 
in the figure, the EC50 ranges from 42 nM for ODN 5890 to 1220 nM for ODN 5897. 
5 The assay demonstrates sensitive differentiation between subtle changes in ligand. 



10 



Example 15. Influence of Assay Kinetics on TLR Screening Assays 

Curves were prepared as in the previous Example 14 with the following ODN 
ligands, where * indicates phosphrothioate and _ indicates phosophodiester linkage: 



5890: T*C*C*A*T*G*A*C*G*T*T*T*T*T*G*A*T*G*T*T (SEQ ID NO:35) 

5497: T*C*G*T*C*G*T*T*T*T_G_T_C_G_T*T*T*T*G*T*C*G*T*T (SEQ ID NO:36) 

5746: T*C_G*T*C_G*T*T*T*T_G*T ,|, C_G*T*T*T*T*G*T*C_G*T*T (SEQ ID NO:37) 

2006: T*C*G*T*C*G*T*T*T*T*G*T*C*G*T*T*T*T*G*T*C*G*T*T (SEQ ID NO: 15) 

15 5902: T*C*C*A*T*G*A*C_G_T*T*T*T*T*G*A*T_G*T*T (SEQ ID NO:38) 

A family of stimulation curves was determined at various times of assay incubation 
between 1 and 24 hours. The EC50 was determined for each ligand at each time point. 
The EC50 was then plotted versus time to yield the resultant curves shown in Figure 7. 

20 As evident from Figure 7, it is demonstrated that the kinetics of activation vary 

dependent on the ligand tested. Because luciferase has a three-hour half-life, the signal 
is transient and requires constant promoter-driven activation to be maintained. The 
maintenance is directly related to the signal delivered by the ligand/receptor complex. 
Thus analysis of time kinetics in such a fashion allows one to determine both affinity of 

25 ligand/receptor interaction and the availability of the ligand to the receptor through 
time. The principle is demonstrated as follows. The ODN 5890 is of higher affinity 
compared to the ODN 2006. When the ligand is made more labile to destruction by 
incorporating less stable diester linkages, the activity .curves turn upward with time 
such as for ODN 5746, 5902 and 5497. 

30 In the context of a screening assay for TLR/ligand interactions, limiting the 

assay to one time point would bias the assay. At 24 hours it would appear that only 
ODN 2006 and 5890 were ligand candidates, however this is clearly not the case. The 
assay also demonstrates that earlier time points, such as 6 hours in this example, would 
be the optimal time point for determining the greatest difference between 
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receptor/ligand affinities. Thus optimization of the screening assay can be adjusted 
depending on the desired information to be obtained from the screen, e.g., higher 
affinity of interaction versus stability and duration of receptor/ligand interaction. 

Figure 8 demonstrates the same principles shown with a murine TLR as in this 
5 example can be applied independent of the TLR utilized. For this set of data a 293 cell 
stably transfected with human TLR9 and NF-KB-luciferase was used. 

Example 16. Influence of Assay Kinetics on Maximal Activities in TLR Screening 
Assays 

10 Data was collected as in the previous Example 1 5, however the maximal 

activity (maximal fold induction) was plotted versus time in Figures 9 and 10. Such 
data analysis results in a prediction of biological efficacy. As can be seen from these 
figures, the lower affinity ODN, e.g., ODN 2006 and 5890 as demonstrated by the 
EC50 curves of Example 15, are clearly less efficient at delivering high activity. 

15 

Example 17. Differential Outcomes of TLR Screening Assays Dependent on 
Promoter Utilization 

Human 293 fibroblast cells were transiently transfected with expression vector 
for TLR 7, TLR8, or TLR9 and one of the following reporter constructs bearing the 

20 following promoters driving the luciferase gene: NF-kB-1uc, IP-10-luc, RANTES-luc, 
ISRE-luc, and IL-8-luc. The cells were stimulated for 16h with the maximal activity 
concentration of specific ligand. TLR9 was stimulated with CpG ODN 2006; TLR8 
and TLR7 were stimulated with the imidazolquinalone R848. Results are shown in 
Figure 11. As evident from the figure, the promoter used influences the outcome of 

25 the screening assay dependent on the TLR in question. For example, NF-kB is a 

reliable marker for all TLRs tested, whereas in this set of experiments ISRE was only 
functional to some extent for TLR8. The IL-8 promoter is particularly sensitive for 
TLR7 or TLR8 screening assays but would be much less efficient in TLR9 assays. 

30 What is claimed is: ; 
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Claims 

1 . A screening method for identifying an immunostimulatory compound, 
comprising: 

5 contacting a functional TLR3 with a test compound under conditions which, in 

absence of the test compound, permit a negative control response mediated by a 

i 

TLR3 signal transduction pathway; 

detecting a test response mediated by the TLR3 signal transduction pathway; 

and 

JO determining the test compound is an immunostimulatory compound when the 

test response exceeds the negative control response. 

2. A screening method for identifying an immunostimulatory compound, 
comprising: 

15 contacting a functional TLR3 with a test compound under conditions which, in 

presence of a reference immunostimulatory compound, permit a reference response 
mediated by a TLR3 signal transduction pathway; 

detecting a test response mediated by the TLR3 signal transduction pathway; 

and 

20 determining the test compound is an immunostimulatory compound when the 

test response equals or exceeds the reference response. 

A screening method for identifying a compound that modulates TLR3 signaling 
activity, comprising: 

contacting a functional TLR3 with a test compound and a reference 
immunostimulatory compound under conditions which, in presence of the reference 
immunostimulatory compound alone, permit a reference response mediated by a 
TLR3 signal transduction pathway; 

detecting a test-reference response mediated by the TLR3 signal transduction 
pathway; 

determining the test compound is an agonist of TLR3 signaling activity when 
the test-reference response exceeds the reference response; and 
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determining the test compound is an antagonist of TLR3 signaling activity when 
the reference response exceeds the test-reference response. 

4. A screening method for identifying species specificity of an immunostimulatory 
5 compound, comprising: 

measuring a first species-specific response mediated by a TLR3 signal 
transduction pathway when a functional TLR3 of a first species is contacted with a 
test compound; 

measuring a second species-specific response mediated by the TLR3 signal 
10 transduction pathway when a functional TLR3 of a second species is contacted with 
the test compound; and 

comparing the first species-specific response with the second species-specific 
response. 

15 5. The method of any one of claims 1-4, wherein the screening method is 
performed on a plurality of test compounds. 

6. The method of claim 5, wherein the response mediated by the TLR3 signal 
transduction pathway is measured quantitatively. 

20 

7. The method of any one of claims 1-4, wherein tt^e functional TLR3 is expressed 
in a cell. 

8. The method of claim 7, wherein the cell is an isolated mammalian cell that 
25 naturally expresses the functional TLR3. 

9. The method of claim 7, wherein the cell is an isolated mammalian cell that does 
not naturally express the functional TLR3, and wherein the cell comprises an 
expression vector for TLR3 . 

30 

10. The method of claim 9, wherein the cell is a 293 human fibroblast. 
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1 1 . The method of claim 7, wherein the cell comprises an expression vector 
comprising an isolated nucleic acid which encodes a reporter construct selected 
from the group of interleukin-6-luciferase (IL-6-luc), IL-8-luc, IL-12 p40-luc, IL-12 
p40-p-Gal, NF-kB-1uc, APl-luc, IFN-a-luc, IFN-p-luc, RANTES-luc, TNF-luc, IP- 

5 10-luc, I-TAC-luc, and ISRE-luc. 

1 2. The method of claim 1 1 , wherein the reporter construct is ISRE-luc. 

13. The method of any one of claims 1-4, wherein the functional TLR3 is part of a 
10 cell-free system. 

14. The method of any one of claims 1-4, wherein the functional TLR3 is part of a 
complex with a non-TLR protein selected from the group consisting of MyD88, IL- 
1 receptor associated kinase 1-3 (IRAKI, IRAK2, IRAK3), tumor necrosis factor 

15 receptor-associated factor 1-6 (TRAF1 - TRAF6), IkB, NF-kB, MyD88-adapter- 
like (Mai), Toll-interleukin 1 receptor (TIR) domain-containing adapter protein 
(T1RAP), Tollip, Rac, and functional homologues and derivatives thereof. 

15. The method of claim 14, wherein the non-TLR protein excludes MyD88. 

20 

16. The method of claim 2 or 3, wherein the reference immunostimulatory 
compound is a nucleic acid. 

17. The method of claim 16, wherein the nucleic acid is a CpG nucleic acid. 

25 

18. The method of claim 2 or 3, wherein the reference immunostimulatory 
compound is a small molecule. 

19. The method of any one of claims 1-4, wherein the test compound is a part of a 
30 combinatorial library of compounds. 
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20. The method of any one of claims 1-4, wherein the test compound is a nucleic 
acid. 

21. The method of claim 20, wherein the nucleic acid is a CpG nucleic acid. 

5 

22. The method of any one of claims 1-4, wherein the test compound is a small 
molecule. 

23. The method of any one of claims 1-4, wherein the test compound is a 
JO polypeptide. 

24. The method of any one of claims 1-4, wherein the response mediated by a 
TLR3 signal transduction pathway is induction of a reporter gene under control of a 
promoter response element selected from the group consisting of ISRE, IL-6, IL-8, 

15 IL-12 p40, DFN-ct, IFN-p, IFN-co, RANTES, TNF, EP-10, and I-TAC. 

25. The method of claim 24, wherein the reporter gene under control of a promoter 
response element is selected from the group consisting of ISRE-luc, IL-6-luc, IL-8- 
luc, IL-12 p40-luc, IL-12 p40-p-Gal, IFN-a-luc, IFN-P-luc, RANTES-luc, TNF- 

20 luc, IP -1 0-luc, and I-TAC-luc. 

26. The method of claim 25, wherein the reporter gene under control of a promoter 
response element is ISRE-luc. 

25 27. The method of claim 24, wherein the reporter gene is selected from the group 
consisting of IFN-al-luc and IFN-a4-luc. 

28. The method of any one of claims 1-4, wherein the response mediated by a 
TLR3 signal transduction pathway is selected from the group consisting of (a) 
30 induction of a reporter gene under control of a minimal promoter responsive to a 
transcription factor selected from the group consisting of API, NF-kB, ATF2, 
IRF3, and IRF7; (b) secretion of a chemokine; and (c) secretion of a cytokine. 



-46- 



WO 03/031573 



PCTYUS02/31460 



29. The method of claim 28, wherein the response mediated by a TLR3 signal 
transduction pathway is induction of a reporter gene selected from the group 
consisting of APl-luc and NF-kB-Iuc. 

5 

30. The method of claim 28, wherein the response mediated by a TLR3 signal 
transduction pathway is secretion of a type 1 EFN. 

31 . The method of claim 28, wherein the response mediated by a TLR3 signal 
10 transduction pathway is secretion of a chemokine selected from the group 

consisting of CCL5 (RANTES), CXCL9 (Mig), CXCL10 (HMO), and CXCL1 1 (I- 
TAC). 

32. The method of any one of claims 1-3, wherein the contacting a functional TLR3 
15 with a test compound further comprises, for each test compound, contacting with 

the test compound at each of a plurality of concentrations. 

33. The method of any one of claims 1 -3, wherein the detecting is performed 6- 1 2 
hours following the contacting. 

20 

34. The method of any one of claims 1-3, wherein the detecting is performed 16-24 
hours following the contacting. 
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SEQUENCE LISTING 

<110> Coley Pharmaceutical GmbH 

<120> TOLL-LIKE RECEPTOR 3 SIGNALING AGONISTS AND ANTAGONISTS 

<130> C01041. 70031 

<160> 117 

<170> Patentln version 3.1 

<210> 1 

<211> 3029 

<212> DNA 

<213> Homo sapiens 

<400> 1 



gcggccgcgt 


cgacgaaatg 


tctggatttg 


gactaaagaa 


aaaaggaaag 


gctagcagtc 


60 


atccaacaga 


atcatgagac 


agactttgcc 


ttgtatctac 


ttttgggggg 


gccttttgcc 


120 


ctttgggatg 


ctgtgtgcat 


cctccaccac 


caagtgcact 


gttagccatg 


aagttgctga 


180 


ctgcagccac 


ctgaagttga 


ctcaggtacc 


cgatgatcta 


cccacaaaca 


taacagtgtt 


240 


gaaccttacc 


cataatcaac 


tcagaagatt 


accagccgcc 


aacttcacaa 


ggtatagcca 


300 


gctaactagc 


ttggatgtag 


gatttaacac 


catctcaaaa 


ctggagccag 


aattgtgcca 


360 


gaaacttccc 


atgttaaaag 


ttttgaacct 


ccagcacaat 


gagctatctc 


aactttctga 


420 


taaaaccttt 


gccttctgca 


cgaatttgac 


tgaactccat 


ctcatgtcca 


actcaatcca 


480 


gaaaattaaa 


aataatccct 


ttgtcaagca 


gaagaattta 


atcacattag 


atctgtctca 


540 


taatggcttg 


tcatctacaa 


aattaggaac 


tcaggttcag 


ctggaaaatc 


tccaagagct 


600 


tctattatca 


aacaataaaa 


ttcaagcgct 


aaaaagtgaa 


gaactggata 


tctttgccaa 


660 


ttcatcttta 


aaaaaattag 


agttgtcatc 


gaatcaaatt 


aaagagtttt 


ctccagggtg 


720 


ttttcacgca 


attggaagat 


tatttggcct 


ctttctgaac 


aatgtccagc 


tgggtcccag 


780 


ccttacagag 


aagctatgtt 


tggaattagc 


aaacacaagc 


attcggaatc 


tgtctctgag 


840 


taacagccag 


ctgtccacca 


ccagcaatac 


aactttcttg 


ggactaaagt 


ggacaaatct 


900 


cactatgctc 


gatctttcct 


acaacaactt 


aaatgtggtt 


ggtaacgatt 


cctttgcttg 


960 


gcttccacaa 


ctagaatatt 


tcttcctaga 


gtataataat 


atacagcatt 


tgttttctca 


1020 


ctctttgcac 


gggcttttca 


atgtgaggta 


cctgaatttg 


aaacggtctt 


ttactaaaca 


1080 


aagtatttcc 


cttgcctcac 


tccccaagat 


tgatgatttt 


tcttttcagt 


ggctaaaatg 


1140 


tttggagcac 


cttaacatgg 


aagataatga 


tattccaggc 


ataaaaagca 


atatgttcac 


1200 


aggattgata 


aacctgaaat 


acttaagtct 


atccaactcc 


tttacaagtt 


tgcgaacttt 


1260 
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gacaaatgaa 


acatttgtat 


cacttgctca 


ttctccctta 


cacatactca 


acctaaccaa 


1320 


gaataaaatc 


tcaaaaatag 


agagtgatgc 


tttctcttgg 


ttgggccacc 


tagaagtact 


1380 


tgacctgggc 


cttaatgaaa 


ttgggcaaga 


actcacaggc 


caggaatgga 

.1 


gaggtctaga 


1440 


aaatattttc 


gaaatctatc 


tttcctacaa 


caagtacctg 


cagctgacta 


ggaactcctt 


1500 


tgccttggtc 


ccaagccttc 


aacgactgat 


gctccgaagg 


gtggccctta 


aaaatgtgga 


1560 


tagctctcct 


tcaccattcc 


agcctcttcg 


taacttgacc 


attctggatc 


taagcaacaa 


1620 


caacatagcc 


aacataaatg 


atgacatgtt 


ggagggtctt 


gagaaactag 


aaattctcga 


1680 


tttgcagcat 


aacaacttag 


cacggctctg 


gaaacacgca 


aaccctggtg 


gtcccattta 


1740 


tttcctaaag 


ggtctgtctc 


acctccacat 


ccttaacttg 


gagtccaacg 


gctttgacga 


1800 


gatcccagtt 


gaggtcttca 


aggatttatt 


tgaactaaag 


atcatcgatt 


taggattgaa 


1860 


taatttaaac 


acacttccag 


catctgtctt 


taataatcag 


gtgtctctaa 


agtcattgaa 


1920 


ccttcagaag 


aatctcataa 


catccgttga 


gaagaaggtt 


ttcgggccag 


ctttcaggaa 


1980 


cctgactgag 


ttagatatgc 


gctttaatcc 


ctttgattgc 


acgtgtgaaa 


gtattgcctg 


2040 


gtttgttaat 


tggattaacg 


agacccatac 


caacatccct 


gagctgtcaa 


gccactacct 


2100 


ttgcaacact 


ccacctcact 


atcatgggtt 


cccagtgaga 


ctttttgata 


catcatcttg 


2160 


caaagacagt 


gccccctttg 


aactcttttt 


catgatcaat 


accagtatcc 


tgttgatttt 


2220 


tatctttatt 


gtacttctca 


tccactttga 


gggctggagg 


atatcttttt 


attggaatgt 


2280 


ttcagtacat 


cgagttcttg 


gtttcaaaga 


aatagacaga 


cagacagaac 


agtttgaata 


2340 


tgcagcatat 


ataattcatg 


cctataaaga 


taaggattgg 


gtctgggaac 


atttctcttc 


2400 


aatggaaaag 


gaagaccaat 


ctctcaaatt 


ttgtctggaa 


gaaagggact 


ttgaggcggg 


2460 


tgtttttgaa 


ctagaagcaa 


ttgttaacag 


catcaaaaga 


agcagaaaaa 


ttatttttgt 


2520 


tataacacac 


catctattaa 


aagacccatt 


atgcaaaaga 


ttcaaggtac 


atcatgcagt 


2580 


tcaacaagct 


attgaacaaa 


atctggattc 


cattatattg 


gttttccttg 


aggagattcc 


2640 


agattataaa 


ctgaaccatg 


cactctgttt 


gcgaagagga 


atgtttaaat 


ctcactgcat 


2700 


cttgaactgg 


ccagttcaga 


aagaacggat 


aggtgccttt 


cgtcataaat 


tgcaagtagc 


2760 


acttggatcc 


aaaaactctg 


tacattaaat 


ttatttaaat 


attcaattag 


caaaggagaa 


2820 


actttctcaa 


tttaaaaagt 


tctatggcaa 


atttaagttt 


tccataaagg 


tgttataatt 


2880 


tgtttattca 


tatttgtaaa 


tgattatatt 


ctatcacaat 


tacatctctt 


ctaggaaaat 


2940 


gtgtctcctt 


atttcaggcc 


tatttttgac 


aattgactta 


attttaccca 


aaataaaaca 


3000 
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tataagcacg caaaaaaaaa aaaaaaaaa 3 029 



<210> 2 

<211> 904 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Arg Gin Thr Leu Pro Cys lie Tyr Phe Trp Gly Gly Leu Leu Pro 
15 10 15 



Phe Gly Met Leu Cys Ala Ser Ser Thr Thr Lys Cys Thr Val Ser His 
20 25 30 



Glu Val Ala Asp Cys Ser His Leu Lys Leu Thr Gin Val Pro Asp Asp 
35 40 45 



Leu Pro Thr Asn lie Thr Val Leu Asn Leu Thr His Asn Gin Leu Arg 
50 55 60 



Arg Leu Pro Ala Ala Asn Phe Thr Arg Tyr Ser Gin Leu Thr Ser Leu 
65 70 75 80 



Asp Val Gly Phe Asn Thr lie Ser Lys Leu Glu Pro Glu Leu Cys Gin 
85 90 95 



Lys Leu Pro Met Leu Lys Val Leu Asn Leu Gin His Asn Glu Leu Ser 
100 105 110 



Gin Leu Ser Asp Lys Thr Phe Ala Phe Cys Thr Asn Leu Thr Glu Leu 
115 120 125 



His Leu Met Ser Asn Ser lie Gin Lys lie Lys Asn Asn Pro Phe Val 
130 135 140 



Lys Gin Lys Asn Leu lie Thr Leu Asp Leu Ser His Asn Gly Leu Ser 
145 150 155 160 



Ser Thr Lys Leu Gly Thr Gin Val Gin Leu Glu Asn Leu Gin Glu Leu 
165 170 175 



Leu Leu Ser Asn Asn Lys lie Gin Ala Leu Lys Ser Glu Glu Leu Asp 
180 185 190 



He Phe Ala Asn Ser Ser Leu Lys Lys Leu Glu Leu Ser Ser Asn Gin 
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195 200 205 



lie Lys Glu Phe Ser Pro Gly Cys Phe His Ala lie Gly Arg Leu Phe 
210 215 220 



Gly Leu Phe Leu Asn A9n Val Gin Leu Gly Pro Ser Leu Thr Glu Lys 
225 230 235 240 



Leu Cys Leu Glu Leu Ala Asn Thr Ser lie Arg Asn Leu Ser Leu Ser 
245 250 255 



Asn Ser Gin Leu Ser Thr Thr Ser Asn Thr Thr Phe Leu Gly Leu Lys 
260 265 270 



Trp Thr Asn Leu Thr Met Leu Asp Leu Ser Tyr Asn Asn Leu Asn Val 
275 280 285 



Val Gly Asn Asp Ser Phe Ala Trp Leu Pro Gin Leu Glu Tyr Phe Phe 
290 295 300 



Leu Glu Tyr Asn Asn lie Gin His Leu Phe Ser His Ser Leu His Gly 
305 310 315 320 



Leu Phe Asn Val Arg Tyr Leu Asn Leu Lys Arg Ser Phe Thr Lys Gin 
325 330 335 



Ser He Ser Leu Ala Ser Leu Pro Lys He Asp Asp Phe Ser Phe Gin 
340 345 350 



Trp Leu Lys Cys Leu Glu His Leu Asn Met Glu Asp Asn Asp He Pro 
355 360 365 



Gly He Lys Ser Asn Met Phe Thr Gly Leu He Asn Leu Lys Tyr Leu 
370 375 380 



Ser Leu Ser Asn Ser Phe Thr Ser Leu Arg Thr Leu Thr Asn Glu Thr 
385 390 395 400 



Phe Val Ser Leu Ala His Ser Pro Leu His He Leu Asn Leu Thr Lys 
405 410 ' 415 



Asn Lys He Ser Lys He Glu Ser Asp Ala Phe Ser Trp Leu Gly His 
420 425 430 
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Leu Glu Val Leu Asp Leu Gly Leu Asn Glu He Gly Gin Glu Leu Thr 
435 440 445 



Gly Gin Glu Trp Arg Gly Leu Glu Asn He Phe Glu He Tyr Leu Ser 
450 455 460 



Tyr Asn Lys Tyr Leu Gin Leu Thr Arg Asn Ser Phe Ala Leu Val Pro 
465 * 470 475 480 



Ser Leu Gin Arg Leu Met Leu Arg Arg Val Ala Leu Lys Asn Val Asp 
485 490 495 



Ser Ser Pro Ser Pro Phe Gin Pro Leu Arg Asn Leu Thr He Leu Asp 
500 505 " 510 



Leu Ser Asn Asn Asn He Ala Asn He Asn Asp Asp Met Leu Glu Gly 
515 520 525 



Leu Glu Lys Leu Glu He Leu Asp Leu Gin His Asn Asn Leu Ala Arg 
530 535 540 



Leu Trp Lys His Ala Asn Pro Gly Gly Pro He Tyr Phe Leu Lys Gly 
545 550 555 560 



Leu Ser His Leu His He Leu Asn Leu Glu Ser Asn Gly Phe Asp Glu 
565 570 575 



He Pro Val Glu Val Phe Lys Asp Leu Phe Glu Leu Lys He He Asp 
580 585 590 



Leu Gly Leu Asn Asn Leu Asn Thr Leu Pro Ala Ser Val Phe Asn Asn 
595 600 605 



Gin Val Ser Leu Lys Ser Leu Asn Leu Gin Lys Asn Leu He Thr Ser 
610 615 620 



Val Glu Lys Lys Val Phe Gly Pro Ala Phe Arg Asn Leu Thr Glu Leu 
625 630 635 640 



Asp Met Arg Phe Asn Pro Phe Asp Cys Thr Cys Glu Ser He Ala Trp 
645 650 655 



Phe Val Asn Trp He Asn Glu Thr His Thr Asn He Pro Glu Leu Ser 
660 665 670 
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Ser His Tyr Leu Cys Asn Thr Pro Pro His Tyr His Gly Phe Pro Val 
675 680 685 



Arg Leu Phe Asp Thr Ser Ser Cys Lys Asp Ser Ala Pro Phe Glu Leu 
690 695 700 



Phe Phe Met He Asn Thr Ser He Leu Leu He Phe He Phe He Val 
705 710 715 720 



Leu Leu He His Phe Glu Gly Trp Arg He Ser Phe Tyr Trp Asn Val 
725 730 735 



Ser Val His Arg Val Leu Gly Phe Lys Glu He Asp Arg Gin Thr Glu 
740 745 750 



Gin Phe Glu Tyr Ala Ala Tyr He He His Ala Tyr Lys Asp Lys Asp 
755 760 765 



Trp Val Trp Glu His Phe Ser Ser Met Glu Lys Glu Asp Gin Ser Leu 
770 775 780 



Lys Phe Cys Leu Glu Glu Arg Asp Phe Glu Ala Gly Val Phe Glu Leu 
785 790 795 800 



Glu Ala He Val Asn Ser He Lys Arg Ser Arg Lys lie lie Phe Val 
805 810 815 



He Thr His His Leu Leu Lys Asp Pro Leu Cys Lys Arg Phe Lys Val 
820 825 830 



His His Ala Val Gin Gin Ala lie Glu Gin Asn Leu Asp Ser He lie 
835 840 845 



Leu Val Phe Leu Glu Glu lie Pro Asp Tyr Lys Leu Asn His Ala Leu 
850 855 860 



Cys Leu Arg Arg Gly Met Phe Lys Ser His Cys lie Leu Asn Trp Pro 
865 870 875 880 



Val Gin Lys Glu Arg He Gly Ala Phe Arg His Lys Leu Gin Val Ala 
885 890 895 



Leu Gly Ser Lys Asn Ser Val His 
900 
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<210> 3 

<211> 3310 

<212> DNA 

<213> Mus musculus 

<400> 3 



tagaatatga 


tacagggatt 


gcacccataa 


tctgggctga 


atcatgaaag 


ggtgttcctc 


60 


ttatctaatg 


tactcctttg 


ggggactttt gtccctatgg attcttctgg 


tgtcttccac 


120 


aaaccaatgc 


actgtgagat 


aLaaLy Lay l. 


tgactgcagc 


catttgaagc 


taacacacat 


180 


acctgatgat 


cttccctcta 




gttgaatctt 


actcacaacc 


aactcagaag 


240 


attaccacct 


accaacttta 




ccaacttgct 


atcttggatg 


caggatttaa 


300 


ctccatttca 


aaactggagc 


caaaactata 


ccaaatactc 


cctttgttga 


aagtattgaa 


360 


cctgcaacat 


aatgagctct 


ctcaaatttc 


tgatcaaacc 


tttgtcttct 


gcacgaacct 


420 


gacagaactc 


gatctaatgt 


ctaactcaat 


acacaaaatt 


aaaagcaacc 


ctttcaaaaa 


480 


ccagaagaat 


ctaatcaaat 


tagatttgtc 


tcataatggt 


ttatcatcta 


caaagttggg 


540 


aacgggggtc 


caactggaga 


acctccaaga 


actgctctta gcaaaaaata 


aaatccttgc 


600 


gttgcgaagt 


gaagaacttg 


agtttcttgg 


caattcttct 


ttacgaaagt 


tggacttgtc 


660 


atcaaatcca 


cttaaagagt 


tctccccggg 


gtgtttccag 


acaattggca 


agttattcgc 


720 


cctcctcttg 


aacaacgccc 


aactgaaccc 


ccacctcaca 


gagaagcttt 


gctgggaact 


780 


ttcaaacaca 


agcatccaga 


atctctctct 


ggctaacaac 


cagctgctgg 


ccaccagcga 


840 


gagcactttc 


tctgggctga 


agtggacaaa 


tctcacccag 


ctcgatcttt 


cctacaacaa 


900 


cctccatgat 


gtcggcaacg 


gttccttctc 


ctatctccca agcctgaggt 


atctgtctct 


960 


ggagtacaac 


aatatacagc 


gtctgtcccc 


tcgctctttt 


tatggactct 


ccaacctgag 


1020 


gtacctgagt 


ttgaagcgag 


catttactaa gcaaagtgtt 


tcacttgctt 


cacatcccaa 


1080 


cattgacgat 


ttttcctttc 


aatggttaaa 


atatttggaa 


tatctcaaca 


tggatgacaa 


1140 


taatattcca 


agtaccaaaa 


gcaatacctt 


cacgggattg 


gtgagtctga 


agtacctaag 


1200 


tctttccaaa 


actttcacaa 


gtttgcaaac 


tttaacaaat 


gaaacatttg 


tgtcacttgc 


1260 


tcattctccc 


ttgctcactc 


tcaacttaac 


gaaaaatcac 


atctcaaaaa 


tagcaaatgg 


1320 


tactttctct 


tggttaggcc 


aactcaggat 


acttgatctc 


ggccttaatg 


aaattgaaca 


1380 


aaaactcagc 


ggccaggaat 


ggagaggtct 


gagaaatata 


tttgagatct 


acctatccta 


1440 


taacaaatac 


ctccaactgt 


ctaccagttc 


ctttgcattg gtccccagcc 


ttcaaagact 


1500 


gatgctcagg 


agggtggccc 


ttaaaaatgt 


ggatatctcc 


ccttcacctt 


tccgccctct 


1560 
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tcgtaacttg 


accattctgg 


acttaagcaa 


caacaacata 


gccaacataa 


atgaggactt 


1620 


gctggagggt 


cttgagaatc 


tagaaatcct 


ggattttcag 


cacaataact 


tagccaggct 


1680 


ctggaaacgc 


gcaaaccccg 


gtggtcccgt 


taatttcctg 


aaggggctgt 


ctcacctcca 


1740 


catcttgaat 


ttagagtcca 


acggcttaga 


tgaaatccca 


gtcggggttt 


tcaagaactt 


1800 


attcgaacta 


aagagcatca 


atctaggact 


gaataactta 


aacaaacttg 


aaccattcat 


1860 


ttttgatgac 


cagacatctc 


taaggtcact 


gaacctccag 


aagaacctca 


taacatctgt 


1920 


tgagaaggat 


gttttcgggc 


cgccttttca 


aaacctgaac 


agtttagata 


tgcgcttcaa 


1980 


tccgttcgac 


tgcacgtgtg 


aaagtatttc 


ctggtttgtt 


aactggatca 


accagaccca 


2040 


cactaatatc 


tttgagctgt 


ccactcacta 


cctctgtaac 


actccacatc 


attattatgg 


2100 


cttccccctg 


aagcttttcg 


atacatcatc 


ctgtaaagac 


agcgccccct 


ttgaactcct 


2160 


cttcataatc 


agcaccagta 


tgctcctggt 


ttttatactt 


gtggtactgc 


tcattcacat 


2220 


cgagggctgg 


aggatctctt 


tttactggaa 


tgtttcagtg 


catcggattc 


ttggtttcaa 


2280 


ggaaatagac 


acacaggctg 


agcagtttga 


atatacagcc 


tacataattc 


atgcccataa 


2340 


agacagagac 


tgggtctggg 


aacatttctc 


cccaatggaa 


gaacaagacc 


aatctctcaa 


2400 


attttgccta 


gaagaaaggg 


actttgaagc 


aggcgtcctt 


ggacttgaag 


caattgttaa 


2460 


tagcatcaaa 


agaagccgaa 


aaatcatttt 


cgttatcaca 


caccatttat 


taaaagaccc 


2520 


tctgtgcaga 


agattcaagg 


tacatcacgc 


agttcagcaa 


gctattgagc 


aaaatctgga 


2580 


ttcaattata 


ctgatttttc 


tccagaatat 


tccagattat 


aaactaaacc 


atgcactctg 


2640 


tttgcgaaga 


ggaatgttta 


aatctcattg 


catcttgaac 


tggccagttc 


agaaagaacg 


2700 


gataaatgcc 


tttcatcata 


aattgcaagt 


agcacttgga 


tctcggaatt 


cagcacatta 


2760 


aactcatttg 


aagatttgga 


gtcggtaaag 


ggatagatcc 


aatttataaa 


ggtccatcat 


2820 


gaatctaagt 


tttacttgaa 


agttttgtat 


atttatttat 


atgtatiagat 


gatgatatta 


2880 


catcacaatc 


caatctcagt 


tttgaaatat 


ttcggcttat 


ttcattgaca 


tctggtttat 


2940 


tcactccaaa 


taaacacatg 


ggcagttaaa 


aacatcctct 


attaatagat 


tacccattaa 


3000 


ttcttgaggt 


gtatcacagc 


tttaaagggt 


tttaaatatt 


tttatataaa 


taagactgag 


3060 


agttttataa 


atgtaatttt 


ttaaaactcg 


agtcttactg 


tgtagctcag 


aaaggcctgg 


3120 


aaattaatat 


attagagagt 


catgtcttga 


acttatttat 


ctctgcctcc 


ctctgtctcc 


3180 


agagtgttgc 


ttttaagggc 


atgtagcacc 


acacccagct 


atgtacgtgt 


gggattttat 


3240 


aatgctcatt 


tttgagacgt 


ttatagaata 


aaagataatt 


gcttttatgg 


tataaggcta 


3300 
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cttgaggtaa 



3310 



<210> 4 

<211> 905 

<212> PRT 

<213> Mus musculus 

<400> 4 

Met Lys Gly Cys Ser Ser Tyr Leu Met Tyr Ser Phe Gly Gly Leu Leu 
1 5 10 15 



Ser Leu Trp He Leu Leu Val Ser Ser Thr Asn Gin Cys Thr Val Arg 
20 25 30 



Tyr Asn Val Ala Asp Cys Ser His Leu Lys Leu Thr His He Pro Asp 
35 40 45 



Asp Leu Pro Ser Asn He Thr Val Leu Asn Leu Thr His Asn Gin Leu 
50 55 60 



Arg Arg Leu Pro Pro Thr Asn Phe Thr Arg Tyr Ser Gin Leu Ala He 
65 70 75 80 



Leu Asp Ala Gly Phe Asn Ser He Ser Lys Leu Glu Pro Glu Leu Cys 
85 90 95 



Gin He Leu Pro Leu Leu Lys Val Leu Asn Leu Gin His Asn Glu Leu 
100 105 110 



Ser Gin He Ser Asp Gin Thr Phe Val Phe Cys Thr Asn Leu Thr Glu 
115 120 125 



Leu Asp Leu Met Ser Asn Ser He His Lys He Lys Ser Asn Pro Phe 
130 135 140 



Lys Asn Gin Lys Asn Leu He Lys Leu Asp Leu Ser His Asn Gly Leu 
145 150 155 160 



Ser Ser Thr Lys Leu Gly Thr Gly Val Gin Leu Glu Asn Leu Gin Glu 
165 170 175 



Leu Leu Leu Ala Lys Asn Lys lie Leu Ala Leu Arg Ser Glu Glu Leu 
180 185 190 



Glu Phe Leu Gly Asn Ser Ser Leu Arg Lys Leu Asp Leu Ser Ser Asn 
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195 



200 



205 



Pro Leu Lys Glu Phe Ser Pro Gly Cys Phe Gin Thr He Gly Lys Leu 
210 215 220 



Phe Ala Leu Leu Leu Asn Asn Ala Gin Leu Asn Pro His Leu Thr Glu 
225 230 235 240 



Lys Leu Cys Tip Glu Leu Ser Asn Thr Ser lie Gin Asn Leu Ser Leu 
245 250 255 



Ala Asn Asn Gin Leu Leu Ala Thr Ser Glu Ser Thr Phe Ser Gly Leu 
260 265 270 



Lys Trp Thr Asn Leu Thr Gin Leu Asp Leu Ser Tyr Asn Asn Leu His 
275 280 285 



Asp Val Gly Asn Gly Ser Phe Ser Tyr Leu Pro Ser Leu Arg Tyr Leu 
290 295 300 



Ser Leu Glu Tyr Asn Asn He Gin Arg Leu Ser Pro Arg Ser Phe Tyr 
305 310 315 320 



Gly Leu Ser Asn Leu Arg Tyr Leu Ser Leu Lys Arg Ala Phe Thr Lys 
325 330 335 



Gin Ser Val Ser Leu Ala Ser His Pro Asn He Asp Asp Phe Ser Phe 
340 345 350 



Gin Trp Leu Lys Tyr Leu Glu Tyr Leu Asn Met Asp Asp Asn Asn He 
355 360 365 



Pro Ser Thr Lys Ser Asn Thr Phe Thr Gly Leu Val Ser Leu Lys Tyr 
370 375 380 



Leu Ser Leu Ser Lys Thr Phe Thr Ser Leu Gin Thr Leu Thr Asn Glu 
385 390 395 400 



Thr Phe Val Ser Leu Ala His Ser Pro Leu Leu Thr Leu Asn Leu Thr 
405 410 415 



Lys Asn His He Ser Lys He Ala Asn Gly Thr Phe Ser Trp Leu Gly 
420 425 430 
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Gin Leu Arg lie Leu Asp Leu Gly Leu Asn Glu lie Glu Gin Lys Leu 
435 440 445 



Ser Gly Gin Glu Trp Arg Gly Leu Arg Asn He Phe Glu He Tyr Leu 
450 455 460 



Ser Tyr Asn Lys Tyr Leu Gin Leu Ser Thr Ser Ser Phe Ala Leu Val 
465 470 475 480 



Pro Ser Leu Gin Arg Leu Met Leu Arg Arg Val Ala Leu Lys Asn Val 
485 490 495 



Asp He Ser Pro Ser Pro Phe Arg Pro Leu Arg Asn Leu Thr He Leu 
500 505 510 



Asp Leu Ser Asn Asn Asn lie Ala Asn He Asn Glu Asp Leu Leu Glu 
515 520 525 



Gly Leu Glu Asn Leu Glu He Leu Asp Phe Gin His Asn Asn Leu Ala 
530 535 540 



Arg Leu Trp Lys Arg Ala Asn Pro Gly Gly Pro Val Asn Phe Leu Lys 
545 550 555 560 



Gly Leu Ser His Leu His He Leu Asn Leu Glu Ser Asn Gly Leu Asp 
565 570 575 



Glu He Pro Val Gly Val Phe Lys Asn Leu Phe Glu Leu Lys Ser He 
580 585 590 



Asn Leu Gly Leu Asn Asn Leu Asn Lys Leu Glu Pro Phe He Phe Asp 
595 600 605 



Asp Gin Thr Ser Leu Arg Ser Leu Asn Leu Gin Lys Asn Leu He Thr 
610 615 620 



Ser Val Glu Lys Asp Val Phe Gly Pro Pro Phe Gin Asn Leu Asn Ser 
625 630 635 640 



Leu Asp Met Arg Phe Asn Pro Phe Asp Cys Thr Cys Glu Ser He Ser 
645 650 655 



Trp Phe Val Asn Trp He Asn Gin Thr His Thr Asn He Phe Glu Leu 
660 665 670 
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Ser Thr His Tyr Leu Cys Asn Thr Pro His His Tyr Tyr Gly Phe Pro 
675 680 685 



Leu Lys Leu Phe Asp Thr Ser Ser Cys Lys Asp Ser Ala Pro Phe Glu 
690 695 700 



Leu Leu Phe lie lie Ser Thr Ser Met Leu Leu Val Phe lie Leu Val 
705 710 715 720 



Val Leu Leu He His He Glu Gly Trp Arg He Ser Phe Tyr Trp Asn 
725 730 735 



Val Ser Val His Arg He Leu Gly Phe Lys Glu He Asp Thr Gin Ala 
740 745 750 



Glu Gin Phe Glu Tyr Thr Ala Tyr He He His Ala His Lys Asp Arg 
755 760 765 



Asp Trp Val Trp Glu His Phe Ser Pro Met Glu Glu Gin Asp Gin Ser 
770 775 780 



Leu Lys Phe Cys Leu Glu Glu Arg Asp Phe Glu Ala Gly Val Leu Gly 
785 790 795 800 



Leu Glu Ala He Val Asn Ser He Lys Arg Ser Arg Lys He He Phe 
805 810 815 



Val He Thr His His Leu Leu Lys Asp Pro Leu Cys Arg Arg Phe Lys 
820 825 830 



Val His His Ala Val Gin Gin Ala lie Glu Gin Asn Leu Asp Ser He 
835 840 845 



He Leu He Phe Leu Gin Asn He Pro Asp Tyr Lys Leu Asn His Ala 
850 855 860 



Leu Cys Leu Arg Arg Gly Met Phe Lys Ser His Cys lie Leu Asn Trp 
865 870 875 880 



Pro Val Gin Lys Glu Arg He Asn Ala Phe His His Lys Leu Gin Val 
885 890 895 



Ala Leu Gly Ser Arg Asn Ser Ala His 
900 905 
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<210> 5 

<211> 3352 

<212> DNA 

<213> Homo sapiens 

<400> 5 



aggctggtat 


aaaaatctta 


cttcctctat 


tctctgagcc 


gctgctgccc 


ctgtgggaag 


60 


ggacctcgag 


tgtgaagcat 


ccttccctgt 


agctgctgtc 


cagtctgccc 


gccagaccct 


120 


ctggagaagc 


ccctgccccc 


cagcatgggt 


ttctgccgca 


gcgccctgca 


cccgctgtct 


180 


ctcctggtgc 


aggccatcat 


gctggccatg 


accctggccc 


tgggtacctt 


gcctgccttc 


240 


ctaccctgtg 


agctccagcc 


ccacggcctg 


gtgaactgca 


actggctgtt 


cctgaagtct 


300 


gtgccccact 


tctccatggc 


agcaccccgt 


ggcaatgtca 


ccagcctttc 


cttgtcctcc 


360 


aaccgcatcc 


accacctcca 


tgattctgac 


tttgcccacc 


tgcccagcct 


gcggcatctc 


420 


aacctcaagt 


ggaactgccc 


gccggttggc 


ctcagcccca 


tgcacttccc 


ctgccacatg 


480 


accatcgagc 


ccagcacctt 


cttggctgtg 


cccaccctgg 


aagagctaaa 


cctgagctac 


540 


aacaacatca 


tgactgtgcc 


tgcgctgccc 


aaatccctca 


tatccctgtc 


cctcagccat 


600 


accaacatcc 


tgatgctaga 


ctctgccagc 


ctcgccggcc 


tgcatgccct 


gcgcttccta 


660 


ttcatggacg 


gcaactgtta 


ttacaagaac 


ccctgcaggc 


aggcactgga 


ggtggccccg 


720 


ggtgccctcc 


ttggcctggg 


caacctcacc 


cacctgtcac 


tcaagtacaa 


caacctcact 


780 


gtggtgcccc 


gcaacctgcc 


ttccagcctg 


gagtatctgc 


tgttgtccta 


caaccgcatc 


840 


gtcaaactgg 


cgcctgagga 


cctggccaat 


ctgaccgccc 


tgcgtgtgct 


cgatgtgggc 


900 


ggaaattgcc 


gccgctgcga 


ccacgctccc 


aacccctgca 


tggagtgccc 


tcgtcacttc 


960 


ccccagctac 


atcccgatac 


cttcagccac 


ctgagccgtc 


ttgaaggcct 


ggtgttgaag 


1020 


gacagttctc 


tctcctggct 


gaatgccagt 


tggttccgtg 


ggctgggaaa 


cctccgagtg 


1080 


ctggacctga 


gtgagaactt 


cctctacaaa 


tgcatcacta 


aaaccaaggc 


cttccagggc 


1140 


ctaacacagc 


tgcgcaagct 


taacctgtcc 


ttcaattacc 


aaaagagggt 


gtcctttgcc 


1200 


cacctgtctc 


tggccccttc 


cttcgggagc 


ctggtcgccc 


tgaaggagct 


ggacatgcac 


1260 


ggcatcttct 


tccgctcact 


cgatgagacc 


acgctccggc 


cactggcccg 


cctgcccatg 


1320 


ctccagactc 


tgcgtctgca 


gatgaacttc 


atcaaccagg 


cccagctcgg 


catcttcagg 


1380 


gccttccctg 


gcctgcgcta 


cgtggacctg 


tcggacaacc 


gcatcagcgg 


agcttcggag 


1440 


ctgacagcca 


ccatggggga 


ggcagatgga 


ggggagaagg 


tctggctgca 


gcctggggac 


1500 


cttgctccgg 


ccccagtgga 


cactcccagc 


tctgaagact 


tcaggcccaa 


ctgcagcacc 


1560 
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ctcaacttca 


ccttggatct 


gtcacggaac 


aacctggtga 


ccgtgcagcc 


ggagatgttt 


1620 


gcccagctct 


cgcacctgca 


gtgcctgcgc 


ctgagccaca 


actgcatctc 


gcaggcagtc 


1680 


aatggctccc 


agttcctgcc 


gctgaccggt 


ctgcaggtgc 


tagacctgtc 


ccgcaataag 


1740 


ctggacctct 


accacgagca 


ctcattcacg 


gagctaccgc 


gactggaggc 


cctggacctc 


1800 


agctacaaca 


gccagccctt 


tggcatgcag 


ggcgtgggcc 


acaacttcag 


cttcgtggct 


1860 


cacctgcgca 


ccctgcgcca 


cctcagcctg 


gcccacaaca 


acatccacag 


ccaagtgtcc 


1920 


cagcagctct 


gcagtacgtc 


gctgcgggcc 


ctggacttca 


gcggcaatgc 


actgggccat 


1980 


atgtgggccg 


agggagacct 


ctatctgcac 


ttcttccaag 


gcctgagcgg 


tttgatctgg 


2040 


ctggacttgt 


cccagaaccg 


cctgcacacc 


ctcctgcccc 


aaaccctgcg 


caacctcccc 


2100 


aagagcctac 


aggtgctgcg 


tctccgtgac 


aattacctgg 


ccttctttaa 


gtggtggagc 


2160 


ctccacttcc 


tgcccaaact 


ggaagtcctc 


gacctggcag 


gaaaccggct 


gaaggccctg 


2220 


accaatggca 


gcctgcctgc 


tggcacccgg 


ctccggaggc 


tggatgtcag 


ctgcaacagc 


2280 


atcagcttcg 


tggcccccgg 


cttcttttcc 


aaggccaagg 


agctgcgaga 


gctcaacctt 


2340 


agcgccaacg 


ccctcaagac 


agtggaccac 


tcctggtttg 


ggcccctggc 


gagtgccctg 


2400 


caaatactag 


atgtaagcgc 


caaccctctg 


cactgcgcct 


gtggggcggc 


ctttatggac 


2460 


ttcctgctgg 


aggtgcaggc 


tgccgtgccc 


ggtctgccca 


gccgggtgaa 


gtgtggcagt 


2520 


ccgggccagc 


tccagggcct 


cagcatcttt 


gcacaggacc 


tgcgcctctg 


cctggatgag 


2580 


gccctctcct 


gggactgttt 


cgccctctcg 


ctgctggctg 


tggctctggg 


cctgggtgtg 


2640 


cccatgctgc 


atcacctctg 


tggctgggac 


ctctggtact 


gcttccacct 


gtgcctggcc 


2700 


tggcttccct 


ggcgggggcg 


gcaaagtggg 


cgagatgagg 


atgccctgcc 


ctacgatgcc 


2760 


ttcgtggtct 


tcgacaaaac 


gcagagcgca 


gtggcagact 


gggtgtacaa 


cgagcttcgg 


2820 


gggcagctgg 


aggagtgccg 


tgggcgctgg 


gcactccgcc 


tgtgcctgga 


ggaacgcgac 


2880 


tggctgcctg 


gcaaaaccct 


ctttgagaac 


ctgtgggcct 


cggtctatgg 


cagccgcaag 


2940 


acgctgtttg 


tgctggccca 


cacggaccgg 


gtcagtggtc 


tcttgcgcgc 


cagcttcctg 


3000 


ctggcccagc 


agcgcctgct 


qqaqqaccqc 


aaqqacqtcq 


tqgtgctqqt 


gatcctgagc 


3060 


cctgacggcc 


gccgctcccg 


ctacgtgcgg 


ctgcgccagc 


gcctctgccg 


ccagagtgtc 


3120 


ctcctctggc 


cccaccagcc 


cagtggtcag 


cgcagcttct 


gggcccagct 


gggcatggcc 


3180 


ctgaccaggg 


acaaccacca 


cttctataac 


cggaacttct 


gccagggacc 


cacggccgaa 


3240 


tagccgtgag 


ccggaatcct 


gcacggtgcc 


acctccacac 


tcacctcacc 


tctgcctgcc 


3300 
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tggtctgacc ctcccctgct cgcctccctc accccacacc tgacacagag ca 3352 



<210> 6 

<211> 1032 

<212> PRT 

<213> Homo sapiens 

<400> 6 

Met Gly Phe Cys Arg Ser Ala Leu His Pro Leu Ser Leu Leu Val Gin 
15 10 15 



Ala lie Met Leu Ala Met Thr Leu Ala Leu Gly Thr Leu Pro Ala Phe 
20 25 30 



Leu Pro Cys Glu Leu Gin Pro His Gly Leu Val Asn Cys Asn Trp Leu 
35 40 45 



Phe Leu Lys Ser Val Pro His Phe Ser Met Ala Ala Pro Arg Gly Asn 
50 55 60 



Val Thr Ser Leu Ser Leu Ser Ser Asn Arg lie His His Leu His Asp 
65 70 75 80 



Ser Asp Phe Ala His Leu Pro Ser Leu Arg His Leu Asn Leu Lys Trp 
85 90 95 



Asn Cys Pro Pro Val Gly Leu Ser Pro Met His Phe Pro Cys His Met 
100 105 110 



Thr He Glu Pro Ser Thr Phe Leu Ala Val Pro Thr Leu Glu Glu Leu 
115 120 125 



Asn Leu Ser Tyr Asn Asn He Met Thr Val Pro Ala Leu Pro Lys Ser 
130 135 140 



Leu He Ser Leu Ser Leu Ser His Thr Asn He Leu Met Leu Asp Ser 
145 150 155 160 



Ala Ser Leu Ala Gly Leu His Ala Leu Arg Phe Leu Phe Met Asp Gly 
165 170 175 



Asn Cys Tyr Tyr Lys Asn Pro Cys Arg Gin Ala Leu Glu Val Ala Pro 
180 185 190 



Gly Ala Leu Leu Gly Leu Gly Asn Leu Thr His Leu Ser Leu Lys Tyr 
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195 



200 



205 



Asn Asn Leu Thr Val Val Pro Arg Asn Leu Pro Ser Ser Leu Glu Tyr 
210 215 220 



Leu Leu Leu Ser Tyr Asn Arg lie Val Lys Leu Ala Pro Glu Asp Leu 
225 230 235 240 



Ala Asn Leu Thr Ala Leu Arg Val Leu Asp Val Gly Gly Asn Cys Arg 
245 250 255 



Arg Cys Asp His Ala Pro Asn Pro Cys Met Glu Cys Pro Arg His Phe 
260 265 270 



Pro Gin Leu His Pro Asp Thr Phe Ser His Leu Ser Arg Leu Glu Gly 
275 280 285 



Leu Val Leu Lys Asp Ser Ser Leu Ser Trp Leu Asn Ala Ser Trp Phe 
290 295 300 



Arg Gly Leu Gly Asn Leu Arg Val Leu Asp Leu Ser Glu Asn Phe Leu 
305 310 315 320 



Tyr Lys Cys lie Thr Lys Thr Lys Ala Phe Gin Gly Leu Thr Gin Leu 
325 330 335 



Arg Lys Leu Asn Leu Ser Phe Asn Tyr Gin Lys Arg Val Ser Phe Ala 
340 345 350 



His Leu Ser Leu Ala Pro Ser Phe Gly Ser Leu Val Ala Leu Lys Glu 
355 360 365 



Leu Asp Met His Gly lie Phe Phe Arg Ser Leu Asp Glu Thr Thr Leu 
370 375 380 



Arg Pro Leu Ala Arg Leu Pro Met Leu Gin Thr Leu Arg Leu Gin Met 
385 390 395 400 



Asn Phe He Asn Gin Ala Gin Leu Gly He Phe Arg Ala Phe Pro Gly 
405 410 ~ 415 



Leu Arg Tyr Val Asp Leu Ser Asp Asn Arg He Ser Gly Ala Ser Glu 
420 425 430 
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Leu Thr Ala Thr Met Gly Glu Ala Asp Gly Gly Glu Lys Val Trp Leu 
435 440 445 



Gin Pro Gly Asp Leu Ala Pro Ala Pro Val Asp Thr Pro Ser Ser Glu 
450 * 455 460 



Asp Phe Arg Pro Asn Cys Ser Thr Leu Asn Phe Thr Leu Asp Leu Ser 
465 470 475 480 



Arg Asn Asn Leu Val Thr Val Gin Pro Glu Met Phe Ala Gin Leu Ser 
485 490 495 



His Leu Gin Cys Leu Arg Leu Ser His Asn Cys lie Ser Gin Ala Val 
500 505 510 



Asn Gly Ser Gin Phe Leu Pro Leu Thr Gly Leu Gin Val Leu Asp Leu 
515 520 525 



Ser Arg Asn Lys Leu Asp Leu Tyr His Glu His Ser Phe Thr Glu Leu 
530 535 540 



Pro Arg Leu Glu Ala Leu Asp Leu Ser Tyr Asn Ser Gin Pro Phe Gly 
545 550 555 560 



Met Gin Gly Val Gly His Asn Phe Ser Phe Val Ala His Leu Arg Thr 
565 570 575 



Leu Arg His Leu Ser Leu Ala His Asn Asn lie His Ser Gin Val Ser 
580 585 590 



Gin Gin Leu Cys Ser Thr Ser Leu Arg Ala Leu Asp Phe Ser Gly Asn 
595 600 605 



Ala Leu Gly His Met Trp Ala Glu Gly Asp Leu Tyr Leu His Phe Phe 
610 615 620 



Gin Gly Leu Ser Gly Leu lie Trp Leu Asp Leu Ser Gin Asn Arg Leu 
625 630 635 640 



His Thr Leu Leu Pro Gin Thr Leu Arg Asn Leu Pro Lys Ser Leu Gin 
645 650 655 



Val Leu Arg Leu Arg Asp Asn Tyr Leu Ala Phe Phe Lys Trp Trp Ser 
660 665 670 
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Leu His Phe Leu Pro Lys Leu Glu Val Leu Asp Leu Ala Gly Asn Arg 
675 " 680 685 



Leu Lys Ala Leu Thr Asn Gly Ser Leu Pro Ala Gly Thr Arg Leu Arg 
690 695 700 



Arg Leu Asp Val Ser Cys Asn Ser lie Ser Phe Val Ala Pro Gly Phe 
705 710 715 720 



Phe Ser Lys Ala Lys Glu Leu Arg Glu Leu Asn Leu Ser Ala Asn Ala 
725 730 735 



Leu Lys Thr Val Asp His Ser Trp Phe Gly Pro Leu Ala Ser Ala Leu 
740 745 750 



Gin lie Leu Asp Val Ser Ala Asn Pro Leu His Cys Ala Cys Gly Ala 
755 760 765 



Ala Phe Met Asp Phe Leu Leu Glu Val Gin Ala Ala Val Pro Gly Leu 
770 775 780 



Pro Ser Arg Val Lys Cy s Gly Ser Pro Gly Gin Leu Gin Gly Leu Ser 
785 790 795 800 



He Phe Ala Gin Asp Leu Arg Leu Cys Leu Asp Glu Ala Leu Ser Trp 
805 810 815 



Asp Cys Phe Ala Leu Ser Leu Leu Ala Val Ala Leu Gly Leu Gly Val 
820 825 830 



Pro Met Leu His His Leu Cys Gly Trp Asp Leu Trp Tyr Cys Phe His 
835 840 845 



Leu Cys Leu Ala Trp Leu Pro Trp Arg Gly Arg Gin Ser Gly Arg Asp 
850 855 860 



Glu Asp Ala Leu Pro Tyr Asp Ala Phe Val Val Phe Asp Lys Thr Gin 
865 870 875 880 



Ser Ala Val Ala Asp Trp Val Tyr Asn Glu Leu Arg Gly Gin Leu Glu 
885 890 895 



Glu Cys Arg Gly Arg Trp Ala Leu Arg Leu Cys Leu Glu Glu Arg Asp 
900 905 910 
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Trp Leu Pro Gly Lys Thr Leu Phe Glu Asn Leu Trp Ala Ser Val Tyr 
915 920 925 



Gly Ser Arg Lys Thr Leu Phe Val Leu Ala His Thr Asp Arg Val Ser 
930 935 940 



Gly Leu Leu Arg Ala Ser Phe Leu Leu Ala Gin Gin Arg Leu Leu Glu 
945 950 955 960 



Asp Arg Lys Asp Val Val Val Leu Val lie Leu Ser Pro Asp Gly Arg 
965 970 975 



Arg Ser Arg Tyr Val Arg Leu Arg Gin Arg Leu Cys Arg Gin Ser Val 
980 985 990 



Leu Leu Trp Pro His Gin Pro Ser Gly Gin Arg Ser Phe Trp Ala Gin 
995 1000 1005 



Leu Gly Met Ala Leu Thr Arg Asp Asn His His Phe Tyr Asn Arg 
1010 1015 1020 



Asn Phe Cys Gin Gly Pro Thr Ala Glu 
1025 1030 



<210> 7 

<211> 3200 

<212> DNA 

<213> Mus musculus 

<400> 7 



tgtcagaggg 


agcctcggga 


gaatcctcca 


tctcccaaca 


tggttctccg 


tcgaaggact 


60 


ctgcacccct 


tgtccctcct 


ggtacaggct 


gcagtgctgg ctgagactct ggccctgggt 


120 


accctgcctg 


ccttcctacc 


ctgtgagctg 


aagcctcatg gcctggtgga 


ctgcaattgg 


180 


ctgttcctga 


agtctgtacc 


ccgtttctct 


gcggcagcat 


cctgctccaa 


catcacccgc 


240 


ctctccttga 


tctccaaccg 


tatccaccac 


ctgcacaact 


ccgacttcgt 


ccacctgtcc 


300 


aacctgcggc 


agctgaacct 


caagtggaac 


tgtccaccca 


ctggccttag 


ccccctgcac 


360 


ttctcttgcc 


acatgaccat 


tgagcccaga 


accttcctgg 


ctatgcgtac 


actggaggag 


420 


ctgaacctga 


gctataatgg 


tatcaccact 


gtgccccgac 


tgcccagctc 


cctggtgaat 


480 


ctgagcctga 


gccacaccaa 


catcctggtt 


ctagatgcta 


acagcctcgc 


cggcctatac 


540 


agcctgcgcg 


ttctcttcat 


ggacgggaac 


tgctactaca 


agaacccctg 


cacaggagcg 


600 
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gtgaaggtga 


ccccaggcgc 


cctcctgggc 


ctgagcaatc 


tcacccatct 


gtctctgaag 


660 


tataacaacc 


tcacaaaggt 


gccccgccaa 


ctgcccccca 


gcctggagta 


cctcctggtg 


720 


tcctataacc 


tcattgtcaa 


gctggggcct 


gaagacctgg 


ccaatctgac 


ctcccttcga 


780 


gtacttgatg 


tgggtgggaa 


ttgccgtcgc 


tgcgaccatg 


cccccaatcc 


ctgtatagaa 


840 


tgtggccaaa 


agtccctcca 


cctgcaccct 


gagaccttcc 


atcacctgag 


ccatctggaa 


900 


ggcctggtgc 


tgaaggacag 


ctctctccat 


acactgaact 


cttcctggtt 


ccaaggtctg 


960 


gtcaacctct 


cggtgctgga 


cctaagcgag 


aactttctct 


atgaaagcat 


caaccacacc 


1020 


aatgcctttc 


agaacctaac 


ccgcctgcgc 


aagctcaacc 


tgtccttcaa 


ttaccgcaag 


1080 


aaggtatcct 


ttgcccgcct 


ccacctggca 


agttccttca 


agaacctggt 


gtcactgcag 


1140 


gagctgaaca 


tgaacggcat 


cttcttccgc 


tcgctcaaca 


agtacacgct 


cagatggctg 


1200 


gccgatctgc 


ccaaactcca 


cactctgcat 


cttcaaatga 


acttcatcaa 


ccaggcacag 


1260 


ctcagcatct 


ttggtacctt 


ccgagccctt 


cgctttgtgg 


acttgtcaga 


caatcgcatc 


1320 


agtgggcctt 


caacgctgtc 


agaagccacc 


cctgaagagg 


cagatgatgc 


agagcaggag 


1380 


gagctgttgt 


ctgcggatcc 


tcacccagct 


ccactgagca 


cccctgcttc 


taagaacttc 


1440 


atggacaggt 


gtaagaactt 


caagttcacc 


atggacctgt 


ctcggaacaa 


cctggtgact 


1500 


atcaagccag 


agatgtttgt 


caatctctca 


cgcctccagt 


gtcttagcct 


gagccacaac 


1560 


tccattgcac 


aggctgtcaa 


tggctctcag 


ttcctgccgc 


tgactaatct 


gcaggtgctg 


1620 


gacctgtccc 


ataacaaact 


ggacttgtac 


cactggaaat 


cgttcagtga 


gctaccacag 


1680 


ttgcaggccc 


tggacctgag 


ctacaacagc 


cagcccttta 


gcatgaaggg 


tataggccac 


1740 


aatttcagtt 


ttgtggccca 


tctgtccatg 


ctacacagcc 


ttagcctggc 


acacaatgac 


1800 


attcataccc 


gtgtgtcctc 


acatctcaac 


agcaactcag 


tgaggtttct 


tgacttcagc 


1860 


ggcaacggta 


tgggccgcat 


gtgggatgag 


gggggccttt 


atctccattt 


cttccaaggc 


1920 


ctgagtggcc 


tgctgaagct 


ggacctgtct 


caaaataacc 


tgcatatcct 


ccggccccag 


1980 


aaccttgaca 


acctccccaa 


gagcctgaag 


ctgctgagcc 


tccgagacaa 


ctacctatct 


2040 


ttctttaact 


ggaccagtct 


gtccttcctg 


cccaacctgg 


aagtcctaga 


cctggcaggc 


2100 


aaccagctaa 


aggccctgac 


caatggcacc 


ctgcctaatg 


gcaccctcct 


ccagaaactg 


2160 


gatgtcagca 


gcaacagtat 


cgtctctgtg 


gtcccagcct 


tcttcgctct 


ggcggtcgag 


2220 


ctgaaagagg 


tcaacctcag 


ccacaacatt 


ctcaagacgg 


tggatcgctc 


ctggtttggg 


2280 


cccattgtga 


tgaacctgac 


agttctagac 


gtgagaagca 


accctctgca 


ctgtgcctgt 


2340 
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ggggcagcct 


tcgtagactt 


actgttggag 


gtgcagacca 


aggtgcctgg cctggctaat 


2400 


ggtgtgaagt 


gtggcagccc 


cggccagctg 


cagggccgta 


gcatcttcgc 


acaggacctg 


2450 


cggctgtgcc 


tggatgaggt 


cctctcttgg 


gactgctttg 


gcctttcact 


cttggctgtg 


*> e o rt 
252 0 


gccgtgggca 


tggtggtgcc 


tatactgcac 


catctctgcg 


gctgggacgt 


ctggtactgt 


2580 


tttcatctgt 


gcctggcatg 


gctacctttg 


ctggcccgca 


gccgacgcag 


cgcccaagct 


2640 


ctcccctatg 


atgccttcgt 


ggtgttcgat 


aaggcacaga 


gcgcagttgc 


ggactgggtg 


2700 


tataacgagc 


tgcgggtgcg 


gctggaggag 


cggcgcggtc 


gccgagccct 


acgcttgtgt 


2760 


ctggaggacc 


gagattggct 


gcctggccag 


acgctcttcg 


agaacctctg 


ggcttccatc 


2820 


tatgggagcc 


gcaagactct 


atttgtgctg 


gcccacacgg 


accgcgtcag 


tggcctcctg 


2880 


cgcaccagct 


tcctgctggc 


tcagcagcgc 


ctgttggaag 


accgcaagga 


cgtggtggtg 


294 0 


ttggtgatcc 


tgcgtccgga 


tgcccaccgc 


tcccgctatg 


tgcgactgcg 


ccagcgtctc 


3000 


tgccgccaga 


gtgtgctctt 


ctggccccag 


cagcccaacg 


ggcagggggg 


cttctgggcc 


3060 


cagctgagta 


cagccctgac 


tagggacaac 


cgccacttct 


ataaccagaa 


cttctgccgg 


3120 


ggacctacag 


cagaatagct 


cagagcaaca 


gctggaaaca 


gctgcatctt 


catgcctggt 


3180 


tcccgagttg 


ctctgcctgc 










3200 



<210> 8 

<211> 1032 

<212> PRT 

<213> Mus mus cuius 

<400> 8 

Met Val Leu Arg Arg Arg Thr Leu His Pro Leu Ser Leu Leu Val Gin 
15 10 15 



Ala Ala Val Leu Ala Glu Thr Leu Ala Leu Gly Thr Leu Pro Ala Phe 
20 25 30 



Leu Pro Cys Glu Leu Lys Pro His Gly Leu Val Asp Cys Asn Trp Leu 
35 40 45 



Phe Leu Lys Ser Val Pro Arg Phe Ser Ala Ala Ala Ser Cys Ser Asn 
50 55 60 



lie Thr Arg Leu Ser Leu lie Ser Asn Arg lie His His Leu His Asn 
65 70 75 80 



Ser Asp Phe Val His Leu Ser Asn Leu Arg Gin Leu Asn Leu Lys Trp 
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85 



90 



95 



Asn Cys Pro Pro Thr Gly Leu Ser Pro Leu His Phe Ser Cys His Met 
100 105 110 



Thr lie Glu Pro Arg Thr Phe Leu Ala Met Arg Thr Leu Glu Glu Leu 
115 120 125 



Asn Leu Ser Tyr Asn Gly lie Thr Thr Val Pro Arg Leu Pro Ser Ser 
130 135 140 



Leu Val Asn Leu Ser Leu Ser His Thr Asn He Leu Val Leu Asp Ala 
145 150 155 160 



Asn Ser Leu Ala Gly Leu Tyr Ser Leu Arg Val Leu Phe Met Asp Gly 
165 170 175 



Asn Cys Tyr Tyr Lys Asn Pro Cys Thr Gly Ala Val Lys Val Thr Pro 
180 185 190 



Gly Ala Leu Leu Gly Leu Ser Asn Leu Thr His Leu Ser Leu Lys Tyr 
195 200 205 



Asn Asn Leu Thr Lys Val Pro Arg Gin Leu Pro Pro Ser Leu Glu Tyr 
210 215 220 



Leu Leu Val Ser Tyr Asn Leu He Val Lys Leu Gly Pro Glu Asp Leu 
225 230 235 240 



Ala Asn Leu Thr Ser Leu Arg Val Leu Asp Val Gly Gly Asn Cys Arg 
245 250 255 



Arg Cys Asp His Ala Pro Asn Pro Cys He Glu Cys Gly Gin Lys Ser 
260 265 270 



Leu His Leu His Pro Glu Thr Phe His His Leu Ser His Leu Glu Gly 
275 280 285 



Leu Val Leu Lys Asp Ser Ser Leu His Thr Leu Asn Ser Ser Trp Phe 
290 295 300 



Gin Gly Leu Val Asn Leu Ser Val Leu Asp Leu Ser Glu Asn Phe Leu 
305 310 315 320 
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Tyr Glu Ser lie Asn His Thr Asn Ala Phe Gin Asn Leu Thr Arg Leu 
325 330 335 



Arg Lys Leu Asn Leu Ser Phe Asn Tyr Arg Lys Lys Val Ser Phe Ala 
340 345 350 



Arg Leu His Leu Ala Ser Ser Phe Lys Asn Leu Val Ser Leu Gin Glu 
355 360 365 



Leu Asn Met Asn Gly He Phe Phe Arg Ser Leu Asn Lys Tyr Thr Leu 
370 375 380 



Arg Trp Leu Ala Asp Leu Pro Lys Leu His Thr Leu His Leu Gin Met 
385 390 395 400 



Asn Phe He Asn Gin Ala Gin Leu Ser He Phe Gly Thr Phe Arg Ala 
405 410 415 



Leu Arg Phe Val Asp Leu Ser Asp Asn Arg He Ser Gly Pro Ser Thr 
420 425 430 



Leu Ser Glu Ala Thr Pro Glu Glu Ala Asp Asp Ala Glu Gin Glu Glu 
435 440 445 



Leu Leu Ser Ala Asp Pro His Pro Ala Pro Leu Ser Thr Pro Ala Ser 
450 455 460 



Lys Asn Phe Met Asp Arg Cys Lys Asn Phe Lys Phe Thr Met Asp Leu 
465 470 475 480 



Ser Arg Asn Asn Leu Val Thr He Lys Pro Glu Met Phe Val Asn Leu 
485 490 495 



Ser Arg Leu Gin Cys Leu Ser Leu Ser His Asn Ser He Ala Gin Ala 
500 505 510 



Val Asn Gly Ser Gin Phe Leu Pro Leu Thr Asn Leu Gin Val Leu Asp 
515 520 525 



Leu Ser His Asn Lys Leu Asp Leu Tyr His Trp Lys Ser Phe Ser Glu 
530 535 540 



Leu Pro Gin Leu Gin Ala Leu Asp Leu Ser Tyr Asn Ser Gin Pro Phe 
545 550 555 560 
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Ser Met Lys Gly lie Gly His Asn Phe Ser Phe Val Ala His Leu Ser 
565 570 575 



Met Leu His Ser Leu Ser Leu Ala His Asn Asp lie His Thr Arg Val 
580 585 590 



Ser Ser His Leu Asn Ser Asn Ser Val Arg Phe Leu Asp Phe Ser Gly 
595 600 605 



Asn Gly Met Gly Arg Met Trp Asp Glu Gly Gly Leu Tyr Leu His Phe 
610 615 620 



Phe Gin Gly Leu Ser Gly Leu Leu Lys Leu Asp Leu Ser Gin Asn Asn 
625 630 635 640 



Leu His lie Leu Arg Pro Gin Asn Leu Asp Asn Leu Pro Lys Ser Leu 
645 650 655 



Lys Leu Leu Ser Leu Arg Asp Asn Tyr Leu Ser Phe Phe Asn Trp Thr 
660 665 670 



Ser Leu Ser Phe Leu Pro Asn Leu Glu Val Leu Asp Leu Ala Gly Asn 
675 680 685 



Gin Leu Lys Ala Leu Thr Asn Gly Thr Leu Pro Asn Gly Thr Leu Leu 
690 695 700 



Gin Lys Leu Asp Val Ser Ser Asn Ser lie Val Ser Val Val Pro Ala 
705 710 715 720 



Phe Phe Ala Leu Ala Val Glu Leu Lys Glu Val Asn Leu Ser His Asn 
725 730 735 



lie Leu Lys Thr Val Asp Arg Ser Trp Phe Gly Pro lie Val Met Asn 
740 745 750 



Leu Thr Val Leu Asp Val Arg Ser Asn Pro Leu His Cys Ala Cys Gly 
755 760 765 



Ala Ala Phe Val Asp Leu Leu Leu Glu Val Gin Thr Lys Val Pro Gly 
770 775 780 



Leu Ala Asn Gly Val Lys Cys Gly Ser Pro Gly Gin Leu Gin Gly Arg 
785 790 795 800 
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Ser lie Phe Ala Gin Asp Leu Arg Leu Cys Leu Asp Glu Val Leu Ser 
805 810 815 



Trp Asp Cys Phe Gly Leu Ser Leu Leu Ala Val Ala Val Gly Met Val 
820 825 830 



Val Pro lie Leu His His Leu Cys Gly Trp Asp Val Trp Tyr Cys Phe 
835 840 845 



His Leu Cys Leu Ala Trp Leu Pro Leu Leu Ala Arg Ser Arg Arg Ser 
850 855 860 



Ala Gin Ala Leu Pro Tyr Asp Ala Phe Val Val Phe Asp Lys Ala Gin 
865 870 875 880 



Ser Ala Val Ala Asp Trp Val Tyr Asn Glu Leu Arg Val Arg Leu Glu 
885 890 895 



Glu Arg Arg Gly Arg Arg Ala Leu Arg Leu Cys Leu Glu Asp Arg Asp 
900 905 910 



Trp Leu Pro Gly Gin Thr Leu Phe Glu Asn Leu Trp Ala Ser lie Tyr 
915 920 925 



Gly Ser Arg Lys Thr Leu Phe Val Leu Ala His Thr Asp Arg Val Ser 
930 935 940 



Gly Leu Leu Arg Thr Ser Phe Leu Leu Ala Gin Gin Arg Leu Leu Glu 
945 950 955 960 



Asp Arg Lys Asp Val Val Val Leu Val He Leu Arg Pro Asp Ala His 
965 970 " ~ 975 



Arg Ser Arg Tyr Val Arg Leu Arg Gin Arg Leu Cys Arg Gin Ser Val 
980 985 990 



Leu Phe Trp Pro Gin Gin Pro Asn Gly Gin Gly Gly Phe Trp Ala Gin 
995 1000 1005 



Leu Ser Thr Ala Leu Thr Arg Asp Asn Arg His Phe Tyr Asn Gin 
1010 1015 1020 



Asn Phe Cys Arg Gly Pro Thr Ala Glu 
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1025 



1030 



<210> 9 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 9 

gaaactcgag ccaccatgag acagactttg ccttgtatct ac 42 



<210> 10 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 11 

<211> 670 

<212> DNA 

<213> Homo sapiens 

<400> 11 

agaaaaattt taaaaaatta ttcattcata tttttaggag ttttgaatga ttggatatgt 60 

aattatattc atattattaa tgtgtatcta tatagatttt tattttgcat atgtactttg 120 

atacaaaatt tacatgaaca aattacacta aaagttattc cacaaatata cttatcaaat 180 

taagttaaat gtcaatagct tttaaactta aattttagtt taacttttct gtcattcttt 240 

actttgaata aaaagagcaa actttgtagt ttttatctgt gaagtagagg tatacgtaat 300 

atacataaat agatatgcca aatctgtgtt attaaaattt catgaagatt tcaattagaa 360 

aaaaatacca taaaaggctt tgagtgcagg tgaaaaatag gcaatgatga aaaaaaatga 420 

aaaacttttt aaacacatgt agagagtgcg taaagaaagc aaaaacagag atagaaagta 480 

caactaggga atttagaaaa tggaaattag tatgttcact atttaagacc tatgcacaga 540 

gcaaagtctt cagaaaacct agaggccgaa gttcaaggtt atccatctca agtagcctag 600 

caatatttgc aacatcccaa tggccctgtc cttttcttta ctgatggccg tgctggtgct 660 

cagctacaaa 670 



<400> 10 

gaaagaattc ttaatgtaca gagtttttgg atccaag 



37 



<210> 12 
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<211> 300 
<212> DNA 
<213> Homo sapiens 



<400> 12 

ttctcaggtc gtttgctttc 


ctttgctttc tcccaagtct 


tgttttacaa 


tttgctttag 


60 


tcattcactg aaactttaaa 


aaacattaga aaacctcaca 


gtttgtaaat 


ctttttccct 


120 


attatatata 


tcataagata 


ggagcttaaa taaagagttt 


tagaaactac 


taaaatgtaa 


180 


atgacatagg 


aaaactgaaa 


gggagaagtg aaagtgggaa 


attcctctga 


atagagagag 


240 


gaccatctca 


tataaatagg 


ccatacccac ggagaaagga 


cattctaact 


gcaacctttc 


300 


<210> 13 

<211> 1031 

<212> DNA 

<213> Homo sapiens 










"* V V «» A J 

agaaggcctt 


acagtgagat 


gggatcccag tatttattga 


gtttcctcat 


tcataaaatg 


60 


gggataataa 


tagtaaatga 


gttgacacgc gctaagacag 


tggaatagtg gctggcacag 


120 


ataagccctc 


ggtaaatggt 


agccaataat gatagagtat 


gctgtaagat 


atctttctct 


180 


ccctctgctt 


ctcaacaagt 


ctctaatcaa ttattccact 


ttataaacaa ggaaatagaa 


240 


ctcaaagaca 


ttaagcactt 


ttcccaaagg tcgcttagca 


agtaaatggg 


agagacccta 


300 


tgaccaggat 


gaaagcaaga 


aattcccaca agaggactca 


ttccaactca 


tatcttgtga 


360 


aaaggttccc 


aatgcccagc 


tcagatcaac tgcctcaatt 


tacagtgtga 


gtgtgctcac 


420 


ctcctttggg 


gactgtatat 


ccagaggacc ctcctcaata 


aaacacttta 


taaataacat 


480 


ccttccatgg 


atgagggaaa 


ggaggtaaga tctgtaatga 


ataagcagga 


actttgaaga 


54 0 


ctcagtgact 


cagtgagtaa 


taaagactca gtgacttctg 


atcctgtcct 


aactgccact 


600 


ccttgttgtc 


cccaagaaag 


cggcttcctg ctctctgagg 


aggacccctt 


ccctggaagg 


660 


taaaactaag 


gatgtcagca 


gagaaatttt tccaccattg 


gtgcttggtc 


aaagaggaaa 


720 


ctgatgagct 


cactctagat 


gagagagcag tgagggagag 


acagagactc gaatttccgg 


780 


aggctatttc 


agttttcttt 


tccgttttgt gcaatttcac 


ttatgatacc 


ggccaatgct 


840 


tggttgctat 


tttggaaact 


ccccttaggg gatgcccctc 


aactggccct 


ataaagggcc 


900 


agcctgagct 


gcagaggatt 


cctgcagagg atcaagacag 


cacgtggacc 


tcgcacagcc 


960 


tctcccacag 


gtaccatgaa 


ggtctccgcg gcagccctcg 


ctgtcatcct 


cattgctact 


1020 


gccctctgcg 


c 








1031 
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<210> 14 

<211> 401 

<212> DNA 

<213> Homo sapiens 

<400> 14 



gatctgtaat 


gaataagcag 


gaactttgaa gactcagtga ctcagtgagt aataaagact 


60 


cagtgacttc 


tgatcctgtc 


ctaactgcca ctccttgttg tcccaagaaa gcggcttcct 


120 


gctctctgag 


gaggacccct 


tccctggaag gtaaaactaa ggatgtcagc agagaaattt 


180 


ttccaccatt 


ggtgcttggt 


caaagaggaa actgatgagc tcactctaga tgagagagca 


240 


gtgagggaga 


gacagagact 


cgaatttccg gagctatttc agttttcttt tccgttttgt 


300 


gcaatttcac 


ttatgatacc 


ggccaatgct tggttgctat tttggaaact ccccttaggg 


360 


gatgcccctc 


aactggccct 


ataaagggcc agcctgagct g 


401 



<210> 15 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 15 

tcgtcgtttt gtcgttttgt cgtt 24 



<210> 16 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 16 

tgctgctttt gtgcttttgt gctt 24 



<210> 17 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 
<220> 

<221> misc_f eature 

<222> (2).. (2) 

<223> n = m5c 
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<220> 

<221> misc_f eature 

<222> (5).. (5) 

<223> n = m5c 



<220> 

<221> misc_f eature 

<222> (13).. (13) 

<223> n = m5c 



<220> 

<221> misc_f eature 

<222> (21) . . (21) 

<223> n = m5c 



<400> 17 

tngtngtttt gtngttttgt ngtt 24 



<210> 18 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 18 

tccatgacgt tcctgatgct 20 



<210> 19 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 19 

tccatgagct tcctgatgct 20 



<210> 20 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 
<220> 

<22l> misc_feature 

<222> (8).. (8) 

<223> n = m5c 
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<400> 20 

tccatgangt tcctgatgct 



20 



<210> 21 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 21 

gcgactggct gcatggcaaa accctctttg 30 



<210> 22 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 22 

caaagagggt tttgccatgc agccagtcgc 30 



<210> 23 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 23 

cgagattggc tgcatggcca gacgctcttc 30 



<210> 24 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 25 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 



<400> 24 

gaagagcgtc tggccatgca gccaatctcg 



30 
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<223> Synthetic oligonucleotide 



<400> 25 
ggcctcagca tcttt 



15 



<210> 26 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 26 

ggcctatcga ttttt 15 



<210> 27 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 28 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 28 

gggttatcga ttaga 15 



<210> 29 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<400> 27 
gggttcccag tgaga 



15 



<400> 29 

cagctccagg gcctatcgat ttttgcacag gacc 



34 



<210> 30 
<211> 34 
<212> DNA 



<213> Artificial sequence 
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<220> 

<223> Synthetic oligonucleotide 



<400> 30 

ggtcctgtgc aaaaatcgat aggccctgga gctg 



34 



<210> 31 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 31 

tccatgacgt ttttgatgtt , 20 



<210> 32 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 33 

<211> 16 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 33 

tccatgacgt ttttga 16 



<210> 34 

<211> 14 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<400> 32 

tccatgacgt ttttgatg 



18 



<400> 34 
tccatgacgt tttt 



14 



<210> 35 
<211> 20 
<212> DNA 



<213> Artificial sequence 
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<220> 

<223> Synthetic oligonucleotide 



<400> 35 

tccatgacgt ttttgatgtt 



20 



<210> 36 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 36 

tcgtcgtttt gtcgttttgt cgtt 24 



<210> 37 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 38 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Synthetic oligonucleotide 

<400> 38 

tccatgacgt ttttgatgtt 20 



<210> 39 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<400> 37 

tcgtcgtttt gtcgttttgt cgtt 



24 



<400> 39 

aagcgaaaat gaaattgact 



20 



<210> 40 
<211> 24 
<212> DNA 
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<213> 



Artificial sequence 



<220> 
<223> 



Synthetic oligonucleotide 



<400> 40 

accatggacg aactgtttcc cctc 



24 



<210> 41 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 41 

accatggacg acctgtttcc cctc 24 



<210> 42 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 43 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 43 

accatggacg atctgtttcc cctc 24 



<210> 44 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Synthetic oligonucleotide 



<400> 42 

accatggacg agctgtttcc cctc 



24 



<400> 44 

accatggacg gtctgtttcc cctc 



24 



<210> 45 
<211> 24 



-34- 



WO 03/031573 



PCT/US02/31460 



<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 45 

accatggacg tactgtttcc cctc 24 



<210> 46 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 46 

accatggacg ttctgtttcc cctc 24 



<210> 47 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 47 

agcgggggcg agcgggggcg 20 



<210> 48 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 48 

agctatgacg ttccaagg 18 



<210> 49 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 49 

atcgactctc gagcgttctc 20 



<210> 50 
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<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 50 

atgacgttcc tgacgtt 17 



<210> 51 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 52 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 52 

atggaaggtc cagcgttctc 20 



<210> 53 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 54 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 54 

atggaggctc catcgttctc 20 



<400> 51 

atggaaggtc caacgttctc 



20 



<400> 53 

atggactctc cagcgttctc 



20 
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<210> 55 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 55 

cacgttgagg ggcat 15 



<210> 56 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 57 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 57 

ctgatttccc cgaaatgatg 20 



<210> 58 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 58 

gagaacgatg gaccttccat 20 



<210> 59 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<400> 56 

caggcataac ggttccgtag 



20 



<400> 59 

gagaacgctc cagcactgat 



20 
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<210> 60 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 60 

gagaacgctc gaccttccat 20 



<210> 61 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 62 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 62 

gagaacgctg gaccttccat 20 



<210> 63 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 64 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 64 

gcatgacgtt gagct 15 



<400> 61 

gagaacgctc gaccttcgat 



20 



<400> 63 

gattgcctga cgtcagagag 



20 
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<210> 65 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 65 

gcggcgggcg gcgcgcgccc 20 

<210> 66 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Synthetic oligonucleotide 



<210> 67 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 67 

gctagacgtt agcgt 15 



<210> 68 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 69 

<211> 15 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 69 



<400> 66 

gcgtgcgttg tcgttgtcgt t 



21 



<400> 68 
gctagacgtt agtgt 



15 
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gctagatgtt agcgt 



15 



<210> 70 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 70 

gcttgatgac tcagccggaa 20 



<210> 71 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 72 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 72 

ggggtcaacg ttgacgggg 19 



<210> 73 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 74 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<400> 71 

ggaatgacgt tccctgtg 



18 



<400> 73 

ggggtcagtc ttgacgggg 



19 
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<400> 74 

gtccatttcc cgtaaatctt 



20 



<210> 75 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 75 

taccgcgtgc gaccctct 18 



<210> 76 

<211> 12 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 77 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 77 

tccacgacgt tttcgacgtt 20 



<210> 78 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 79 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<400> 76 
tcagcgtgcg cc 



12 



<400> 78 

tccataacgt tcctgatgct 



20 



-41 - 
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<400> 79 

tccatagcgt tcctagcgtt 



20 



<210> 80 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 80 

tccatcacgt gcctgatgct 20 



<210> 81 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 82 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 82 

tccatgacgt ccctgatgct 20 



<210> 83 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 84 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 



<400> 81 

tccatgacgg tcctgatgct 



20 



<400> 83 

tccatgacgt gcctgatgct 



20 



-42- 



WO 03/031573 



PCT/US02/31460 



<223> Synthetic oligonucleotide 



<400> 84 

tccatgacgt tcctgacgtt 



20 



<210> 85 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 86 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 86 

tccatgcgtg cgtgcgtttt 20 



<210> 87 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 87 

tccatgcgtt gcgttgcgtt 20 



<210> 88 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<400> 85 

tccatgccgg tcctgatgct 



20 



<400> 88 

tccatggcgg tcctgatgct 



20 



<210> 89 
<211> 20 
<212> DNA 



<213> Artificial sequence 



-43- 



WO 03/031573 
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<220> 

<223> Synthetic oligonucleotide 
<400> 89 

tccatgtcga tcctgatgct 20 



<210> 90 

<211> 20 

<212> DNA 

<213>. Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 90 

tccatgtcgc tcctgatgct 20 



<210> 91 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 91 

tccatgtcgg tcctgatgct 20 



<210> 92 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 92 

tccatgtcgg tcctgctgat 20 



<210> 93 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 93 

tccatgtcgt ccctgatgct 20 



<210> 94 

<211> 20 

<212> DNA 

<213> Artificial 



sequence 



-44- 



WO 03/031573 
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<220> 

<223> Synthetic oligonucleotide 



<400> 94 

tccatgtcgt tcctgatgct 



20 



<210> 95 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 95 

tccatgtcgt tcctgtcgtt 20 



<210> 96 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<210> 97 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 97 

tcctgacgtt cctgacgtt 19 



<210> 98 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 



<400> 96 

tccatgtcgt ttttgtcgtt 



20 



<400> 98 

tcctgtcgtt cctgtcgtt 



19 



<210> 99 
<211> 20 
<212> DNA 



-45- 



WO 03/031573 



PCT/US02/31460 



<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 99 

tcctgtcgtt ccttgtcgtt 



<210> 100 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 100 

tcctgtcgtt ttttgtcgtt 



<210> 101 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 101 

tccttgtcgt tcctgtcgtt 



<210> 102 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 102 

tcgatcgggg cggggcgagc 



<210> 103 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 103 

tcgtcgctgt ctccgcttct t 



<210> 104 
<211> 27 



20 



20 



20 



21 
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<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 
<400> 104 

tcgtcgctgt ctccgcttct tcttgcc 27 



<210> 105 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Synthetic oligonucleotide 

<400> 105 

tcgtcgctgt ctgcccttct t 21 



<210> 106 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 106 

tcgtcgctgt tgtcgtttct t 21 



<210> 107 

<211> 14 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Synthetic oligonucleotide 
<400> 107 

tcgtcgtcgt cgtt 14 



<210> 108 

<211> 20 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Synthetic oligonucleotide 
<400> 108 

tcgtcgttgt cgttgtcgtt 20 



<210> 109 



-47- 



WO 03/031573 



PCT/US02/31460 



<211> 22 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 109 

tcgtcgttgt cgttttgtcg tt 22 



<210> 110 

<211> 18 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 110 

tctcccagcg cgcgccat 18 



<210> 111 

<211> 17 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 111 

tctcccagcg ggcgcat 17 



<210> 112 

<211> 18 

<212> DNA 

<213> Artificial sequence 



<220> 

<223> Synthetic oligonucleotide t 
<400> 112 

tctcccagcg tgcgccat 18 



<210> 113 

<211> 20 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 113 

tgcagattgc gcaatctgca 20 



-48- 



WO 03/031573 
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<210> 114 

<211> 13 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 114 

tgtcgttgtc gtt 13 



<210> 115 

<211> 19 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 115 

tgtcgttgtc gttgtcgtt 19 



<210> 116 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 116 

tgtcgttgtc gttgtcgttg tcgtt 25 



<210> 117 

<211> 21 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Synthetic oligonucleotide 

<400> 117 

tgtcgtttgt cgtttgtcgt t 21 
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